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INTRODUCTION: 


Neurofibromatosis  type  2  (NF2)  is  a  hereditary  disorder 

characterized  by  the  development  of  bilateral  vestibular  schwannomas 
and  is  associated  with  mutations  in  the  tumor  suppressor  gene  called 
the  neurofibromatosis  type  2  (NF2)  gene  (Chang  et  al . ,  2005;  Neff  et 
al . ,  2006) .  The  NF2  gene  encodes  a  protein  named  Merlin  for  moesin- 
ezrin-radixin  ]^ike  prote^  (Trofatter  et  al .  ,  1993)  Merlin  shares  a 
great  deal  of  homology  with  the  ezrin,  radixin,  and  moesin  (ERM) 
proteins,  which  belong  to  the  protein  4.1  superfamily  of 
cytoskeleton-associated  proteins  that  link  cell  surface  glycoproteins 
to  the  actin  cytoskeleton .  Presently,  the  mechanism  by  which  Merlin 
functions  as  a  tumor  suppressor  is  poorly  understood. 

Drosophila  melanogaster  provides  a  genetic  and  developmental 
system,  which  is  amenable  to  experimental  manipulation,  and  has  been 
very  valuable  to  the  study  of  tumor  genetics.  The  Drosophila  homolog 
of  Merlin  shares  sequence  similarity  to  the  human  Merlin  protein 
(McCartney  and  Fehon,  1996;  Fehon  et  al . ,  1997) .  In  addition,  the 
human  NF2  gene  could  rescue  the  lethal  Merlin  mutant  allele  in 
Drosophila,  implying  a  functional  conservation  (LaJeunesse  et  al . , 
1998) .  Molecular  genetic  analysis  reveals  that  Merlin  is  essential 
for  regulation  of  proliferation  and  differentiation  in  the  imaginal 
disc.  However,  understanding  the  tumor-suppressor  function  of  Merlin 
requires  additional  knowledge  about  specific  cell-cycle  points  where 
Merlin  regulates  proliferation  and  coordinates  it  with  morphogenesis. 

We  have  found  that  cells  in  the  wing  imaginal  disc  from  the  fly 
larva  with  Merlin  mutation  (Mer'^ )  displayed  abnormalities  in  the 
control  of  mitosis  exit.  Cytological  images  of  mutant  cells 
frequently  showed  asynchronous  anaphase  and  telophase.  We  have  also 
isolated  adult  Mer'^  pharates.  Interestingly,  these  Mer  mutant  adults 
showed  abnormal  leg  morphology.  Some  of  them  displayed  duplication  of 
the  wing  disc,  and  in  some  cases,  abnormalities  were  seen  in  the 
dorsal/ventral  compartment  border  of  the  Mer  mutant  wing  disc.  These 
results  suggest  that  Merlin  is  important  not  only  for  the  control  of 
mitosis  exit  but  also  for  the  determination/maintenance  of  morphogen 
gradients  in  the  wing  imaginal  disc. 

The  goal  of  our  proposed  research  is  to  examine  the  novel  role  of 
Merlin  in  the  control  of  mitosis  and  development.  Specifically,  we 
plan  to  confirm  the  role  of  Merlin  in  the  control  of  mitosis  and 
determine  whether  there  are  any  additional  points  in  the  cell  cycle 
where  Merlin  executes  its  activity.  We  will  examine  the  role  of 
Merlin  in  wing  imaginal  disc  development  and  the  effect  of  Merlin 
mutation  on  specific  regulatory  protein  expression  within  the  wing 
imaginal  disc.  In  addition,  we  will  attempt  to  investigate  whether 
the  abnormalities  in  mitosis  observed  in  Merlin  mutant  fly  can  also 
be  seen  in  mouse  and  human  schwannoma  cells  lacking  NF2  function. 

From  this  study,  we  hope  to  a  better  understanding  of  how  Merlin 
executes  regulation  of  proliferation  and  how  it  coordinates 
proliferation,  mitosis,  and  morphogenesis.  Future  investigation  of 
the  signaling  pathways  that  link  Merlin  to  intracellular  signals 
regulating  cell  division  may  enable  designs  for  novel  therapeutic 
regiments  to  cure  NF2  schwannomas  and  other  associated  tumors. 
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BODY: 


Aim  1 :  To  conduct  cytological  analysis  on  additional  Merlin  mutant 
alleles  and  allelic  combinations  for  the  control  of  mitosis  exit  and 
morphogenesis 

Task  1:  Accomplished  in  Year  One. 

Task  2:  We  have  completed  the  cytological  analysis  on  tissues 
isolated  from  three  wild-type  strains,  various  Merlin  mutants  [Mer^ 
(Gin324^stop)  ^  (G^^3i8^stop)  ^  ^^^3  (Me t )  ,  and  Mer^  (Gln^"°^"^°P)  ]  , 

as  well  as  the  Mer'^ ;Mer^  strain  that  contains  a  wild-type  Merlin 
transgene  in  the  Mer4  background.  We  observed  that  Merlin  mutants 
displayed  two  types  of  mitosis  exit  asynchrony  more  frequently  than 
the  three  wild-types  strains.  The  first  is  the  asynchronous 
anaphase-telophase  figures  with  one  sister  chromosome  set  in  anaphase 
and  the  other  in  telophase  (Figure  lA-B)  and  the  second  is  the 
asynchronous  telophase-interphase  figures  with  one  sister  chromosome 
set  in  telophase  and  the  other  in  interphase  (Figure  IC-D) . 

Figure  1.  Images  of  normal  mitosis  In  neural  brain  cells  from  the  third- 
Instar  wild-type  larvae  (A-F)  as  compared  with  those  of  asynchronous  mitosis 
exit  In  neural  brain  cells  from  the  Mer'^  mutant  larvae.  During  normal 
mitosis ,  chromosome  condensation  at  heterochromatin  regions  begins  at  early 
prophase  PI  (A) .  Condensed  homologous  chromosomes  are  paired  at  late 
prophase  P2  (B) .  Sister  chromatids  are  connected  at  precentromerlc 
heterochromatin  regions  In  metaphase  (C) .  Chromosomes  are  separated  along 
their  lengths  at  late  metaphase  (D)  .  Two  sister  chromosome  sets  are 
separating  from  each  other  and  oriented  toward  the  opposite  poles  In 
anaphase  (E) .  Fully  separated  sister  chromosome  sets  begin  to  undergo 
chromosome  decondensation  In  telophase  (F)  .  Unlike  normal  mitosis ,  the  Mer^ 
mutant  cells  frequently  display  asynchronous  anaphase-telophase  figures. 
While  two  sister  chromosome  sets  are  separating  from  each  other  and  oriented 
toward  the  opposite  poles,  one  set  of  chromosomes  already  undergoes 
chromosome  decondensation  (G,H)  .  The  Mer'*  mutant  cells  also  frequently  show 
asynchronous  telophase-lnterphase  figures  (I,J). 


A 

• 

B 

> 

4 

C 

• 

V 

D 

0 

* 

E 

% 

F 

G 

■  t 

H 

^  *  m 

I 

• 

J 

■  1  ' 

* 

5 


As  shown  in  Table  1,  in  contrast  to  those  from  the  three  wild-type 
strains,  neural  brain  cells  from  all  four  Merlin  mutants  showed  both 
types  of  mitosis  exit  asynchrony  at  higher  frequencies.  The 
frequency  of  asynchronous  figures  was  lower  in  the  weak  Mer^  (missense 
mutation)  allele  than  the  other  three  truncated  Merlin  alleles,  Mer^ , 
Mer^ ,  and  Mer^ .  Importantly,  when  a  wild-type  Merlin  transgene  was 
introduced  back  to  the  Mer'^  genetic  background,  the  asynchronous 
mitosis  exit  phenotype  was  substantially  diminished.  These  results 
suggest  that  Merlin  is  important  for  the  control  of  mitosis  exit. 


Table  1.  Mitotic  exit  asychrony  in  various  Merlin  alleles,  compared  with  the  wild-type  Lausenne, 
Hikkone  A-W,  and  Oregon  R  strains. _ _ _ _ _ 


Strain 

No.  of  cells  in 

anaphase 

analyzed^ 

No.  of  cells 
with 

asynchronous 

anaphase- 

telophase 

figures'’ 

%  of  cells  in 

anaphase  with 

asynchronous 

anaphase- 

telophase 

figures 

No.  of  cells  in 

telophase 

analyzed^ 

No.  of  cells 
with 

asynchronous 

telophase- 

interphase 

figures*' 

%  of  cells  in 

telophase 

with 

asynchronous 

telophase- 

interphase 

figures 

Lausenne 

298 

0 

0 

11 

0 

0 

Hikkone  A-W 

93 

0 

0 

14 

0 

0 

Oregon  R 

101 

0 

0 

26 

0 

0 

Me/ 

118 

14 

11.9 

40 

16 

40 

Me/;Mer^ 

257 

7 

2.7 

18 

1 

5.6 

Me/ 

161 

16 

9.9 

29 

5 

17.2 

Me/ 

80 

14 

17.5 

10 

3 

30 

Me/ 

38 

5 

13.2 

7 

2 

28.6 

“Cells  with  at  least  one  sister  ehromosome  set  in  anaphase 

*’One  sister  ehromosome  set  in  anaphase  while  the  other  in  telophase. 

“Cells  with  at  least  one  sister  ehromosome  set  in  telophase  but  none  in  anaphase 
‘*One  sister  ehromosome  set  in  telophase  while  the  other  in  interphase. 

Task  3:  The  task  was  proposed  for  the  3^“^  year  study. 


Aim  2 :  To  examine  and  compare  the  duration  of  the  cell  cycle  and 
mitosis  phases  using  various  Merlin  mutants  and  to  study  subcellular 
localization  of  Merlin  at  various  phases  of  mitosis 

Task  4:  This  task  was  completed  in  year  one.  Our  results  indicate 
that  the  G2  phase  of  Mer^  cells  is  about  two  times  shorter  than  that 
of  wild-type  cells. 

Task  5:  We  have  made  an  attempt  to  cultivate  the  Mer'^  and  wild-type 
wing  imaginal  discs  using  the  Robb' s  complete  tissue  culture  medium 
(Ashburner,  1989) .  We  found  that  while  we  were  able  to  cultivate 
wing  imaginal  discs  in  this  medium  for  about  2-3  hr,  cultivation  for 
a  longer  period  led  to  cell  detachment  from  the  discs.  Subsequently, 
we  used  the  Schneider's  tissue  culture  medium  (Sigma  Chemicals  Co.), 
and  we  checked  the  metabolic  activity  of  the  discs  by  BrdU 
incorporation,  followed  by  anti-BrdU  antibody  staining.  With  this 
approach,  we  were  able  to  cultivate  the  discs  for  10-13  hr. 

We  have  performed  a  double  labeling  experiment  with  BrdU  and  anti- 
phospho-histone  H3  antibody  on  the  cultured  wing  imaginal  disc.  BrdU 
incorporation  allowed  the  detection  of  S-phase  cells,  while  anti- 
phospho-histone  H3  antibody  labeled  cells  in  mitosis.  Figure  2 
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displays  an  area  of  the  double  labeled  wing  imaginal  disc.  The  green 
label  illustrates  the  BrdU  incorporated  cells,  which  represented  the 
cells  at  the  S  phase.  The  red  label  represents  the  cells  at  various 
stages  of  mitosis  as  determined  by  chromatin  morphology.  We  are  in 
the  process  of  culturing  BrdU-labeled  cells  for  various  times  and 
then  staining  them  with  anti-phospho-histone  H3  antibody.  By 
counting  the  number  of  doubly  labeled  cells  from  different  time 
points,  we  hope  to  determine  the  durations  of  various  cell  cycle 
phases . 


Figure  2.  Double  labeling  of  the  wing  imaginal  disc  with  BrdU  and  anti- 
phospho-histone  H3  antibody  allows  examination  of  cells  at  various  stages 
the  cell  cycle.  Examples  of  cells  at  the  S  phase  (S) ,  metaphase  (M)  , 
anaphase  (A)  ,  and  telophase  (T)  were  indicated. 
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Task  6:  As  indicated  in  the  year-One  report,  we  found  that  the 
overall  cell-cycle  duration  in  the  wing  disc  was  not  significantly 
affected  by  Merlin  mutation.  To  study  the  effect  of  Merlin  mutation 
on  cell  proliferation  in  different  regions  of  the  wing  imaginal  disc, 
mosaic  clones  of  multiple  wing  hairs  (mwh)  were  induced  in  the  +/mwh 
larvae  with  or  without  Mer^  mutation  at  96  hr  after  egg  laying  (AEL) 
by  lOOOR  of  y-rays  irradiation.  Irradiated  larvae  were  grown  to  the 
adult  stage  and  male  wings  were  removed.  The  mwh/mwh  clone  spots  in 
the  wings  were  photographed.  Clone  spots  from  70  Mer^  and  76  wild- 
type  wings  were  projected  onto  the  map  of  an  adult  wing.  Figure  3 
illustrates  the  distribution  of  mosaic  mwh  clones  in  the  wild-type 
and  Mer^  adult  wings.  Consistent  with  the  previous  finding  (Gonzalez- 
Gaitan  et  al . ,  1994),  the  wild-type  mosaic  clones  induced  at  such  a 
late  developmental  stage  (96  hr  AEL)  respected  the  vein  restriction 
and  resided  within  the  boundary  of  veins.  However,  the  Mer^  mutant 
clones  did  not  follow  such  a  rule  and  some  of  them  crossed  the  vein 
boundary  (arrows  in  Figure  3  point  to  these  clones) .  In  addition  to 
crossing  the  vein,  the  mosaic  clones  that  abut  on  the  vein  were  also 
more  frequently  found  in  the  Mer^  mutant  wing  than  those  in  the  wild- 
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type  control  (Table  1) .  These  results  further  support  the  role  of 
Merlin  in  cell  motility,  cell  adhesion  and  cell  proliferation.  Cell 
lacking  Merlin  may  possess  greater  ability  to  overcome  vein 
restriction . 

Figure  3.  Mer^  mutation  delimits  the  normal  clonal  restriction  of  mosaic 
clones  to  cross  veins.  Arrows  point  to  the  mer^  mutant  clones  that  cross  the 
vein  boundary .  Colors  distinguish  clone  overlaps  with  the  brown  color  for 
two  overlapping  clones,  violet  for  three  overlapping  clones,  and  blue  for 
four  overlapping  clones.  The  red  color  denotes  uni-cell  clones  and  the 
green  color  indicates  two-cell  clones. 


wild  type  Mer^ 


Table  1.  Vein  boundary  restriction  of  mosaic  clones  in  the  wild-type  and  Me/  genetic  background. 
Mosaic  clones  induced  in  each  wing  were  carefully  examined  on  an  enlarged  images.  Data  were 
collected  from  the  analysis  of  70  Me/  and  76  wild-type  wings. _ _ 


Genetic  background 
of  mosaic  clones 

No.  of  clones 
abutting  on  the 
wing  vein 

No.  of  clones 
crossing  the 
vein 

Total  No.  of 

clones 

analyzed 

Mer-^ 

16 

6 

240 

Wild  type 

6 

0 

193 

We  also  induced  mosaic  clones  in  the  wild-type  and  Mer^  wings  late 
in  development  (48h  after  puparium  formation) .  Consistent  with 
previous  observation  showing  that  no  cell  proliferation  occurred  in 
the  wing  disc  at  this  time  (Garcia-Bellido  and  Merriam,  1971),  only 
three  uni-cell  clones  were  found  in  62  wild-type  wings  examined 
(Figure  4) .  In  contrast,  we  identified  26  mosaic  clones  in  47  Mer^ 
wings  analyzed;  among  them,  three  were  two-cell  clones  and  five 
others  contained  multiple  cells.  These  results  indicated  that 
significant  cell  proliferation  took  place  48  hr  after  puparium 
formation  in  the  Mer^  wings.  Our  results  further  argue  the 
possibility  for  a  temporal  difference  in  developmental  timing  between 
the  wild-type  and  Mer^  mutant  larvae. 


Figure  4.  Mosaic  clone  induction  in  the  wild-type  and  Mer^  wing  48h 
after  puparium  formation .  Each  cell  is  marked  by  a  color  dot.  Two-cell 
clones  are  indicated  in  green.  The  cells  in  multi-cell  clones  are  connected 
by  a  line.  The  picture  represents  the  summary  of  mosaic  clones  found  on  62 
wild-type  and  47  Mer^  wings. 


<  ■  v 


wild  type 


Mer^ 


Task  7:  We  acquired  a  small  aliquot  of  anti-Merlin  antibody  from  Dr. 
Rick  Fehon  at  the  University  of  Chicago  and  performed  an 
immunostaining  experiment.  As  reported  previously  (McCartney  and 
Fehon,  1996),  the  Merlin  protein  displayed  cortical  localization  and 
some  cytoplasmic  staining  throughout  the  wing  imaginal  disc  tissue 
(Figure  5A) .  Johnston  and  Edgar  (1998)  previously  showed  that  the 
Wingless  (Wg) -expressing  cells  at  the  dorsal/ventral  (D/V) 
compartment  border  of  the  wing  imaginal  disc  were  arrested  at  the  Gi 
phase  of  the  cell  cycle.  Furthermore,  in  the  anterior  compartment, 
the  stripe  of  Gl-arrested,  Wg-expressing  cells  at  the  D/V  border  was 
surrounded  by  the  two  stripes  of  G2-arrested  cells.  Consistently,  we 
detected  strong  Wg  staining  in  the  cells  at  the  D/V  compartment 
border  of  the  wing  imaginal  disc,  and  the  majority  of  Wg  protein 
staining  was  found  in  the  plasma  membrane  (Figure  5B) .  By 
superimposing  the  Merlin  and  Wg  staining  images  together,  we  found 
that  the  Merlin  and  Wg  proteins  appeared  to  be  colocalized  in  the 
cells  at  the  D/V  compartment  border  (Figure  5C) .  Together  with 
previous  findings,  these  results  indicate  that  the  majority  of  Merlin 
protein  is  localized  in  the  plasma  membrane  of  cells  in  both  the  Gl 
and  G2  phases.  The  colocalization  between  Merlin  and  the  Wg 
morphogen  further  suggest  a  role  of  Merlin  in  Wg  trafficking. 

Figure  5.  Colocalization  of  Merlin  with  the  Wg  morphogen  at  the 
D/V  compartment  horde.  Wing  imaginal  discs  from  the  3^^  instar  wild- 
type  larva  were  fixed  and  stained  for  Merlin  (green)  and  Wg  (red) 
using  the  guinea  pig  anti-Merlin  polyclonal  antibody  (McCartney  BM 
and  Fehon  RG,  1996)  and  the  anti-Wg  monoclonal  antibody  4D4  (Brook 
and  Cohen,  1994),  respectively.  (A)  The  Merlin  protein  was  localized 
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to  the  plasma  membrane  and  some  within  the  cytoplasm .  (B)  The  Wg 

protein  was  synthesized  in  the  cells  at  the  D/V  compartment  border. 
Within  this  stripe,  Wg  was  associated  with  the  plasma  membrane.  The 
Wg  protein  spread  from  the  stripe  of  expressing  cells  to  the  adjacent 
celles  in  a  gradient  fashion  by  vesicle  trafficking  (punctuate 
structures ) .  (C)  Merlin  and  Wg  were  colocalized  at  the  plasma 

membrane  of  cells  at  the  D/V  compartment  border. 


Aim  3 :  To  further  examine  the  role  of  Merlin  in  the 
determination/maintenance  of  the  D/V  compartment  border  in  the 

Drosophila  wing  imaginal  disc  and  to  investigate  how  Merlin  mutation 
affects  the  expression  of  proteins  important  for  the  determination  of 
the  compartment  border 

Task  8:  The  task  was  accomplished  in  year  one  and  the  results 
confirmed  that  Merlin  plays  an  important  role  in  the  determination  of 
the  wing  morphology. 

Task  9:  To  examine  the  role  of  Merlin  in  the  determination  of  the 
D/V  compartment  border  in  the  wing  imaginal  disc,  we  examined  the 
distribution  pattern  of  the  Wg  morphogen  in  the  wild-type  and  Mer^ 
mutant  wing  imaginal  discs.  It  is  well-documented  that  the  Wg 
protein  is  synthesized  by  the  cells  at  the  D/V  compartment  boundary 
and  then  forms  spatial  concentration  gradients  while  moving  away  from 
the  D/V  border  (Seto  and  Bellen,  2004) .  In  contrast  to  the  tight 
stripe  pattern  of  Wg  expression  in  the  wild-type  wing  disc  (Figure 
6A-B) ,  the  Wg  stripe  in  the  mer^  mutant  disc  was  frequently  more 
diffuse,  and  in  about  40%  of  the  cases  some  diffuse  regions  of  the  Wg 
stripe  were  of  particular  notable  (indicated  with  arrows  in  Figure 
6C-D) .  These  results  suggest  that  Merlin  inactivation  may  lead  to  an 
alteration  on  the  determination/maintenance  of  Wg  stripe  expression. 

Fehon  and  colleagues  previously  found  that  Merlin  and  Expanded, 
another  member  of  the  protein  4.1  superfamily,  were  associated  with 
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the  apical  junctional  region  in  imaginal  epithelia  and  with  endocytic 
vesicles  in  cultured  cells  (McCartney  and  Fehon,  1996;  McCartney  et 
al . ,  2000).  Together,  they  regulated  the  abundance,  localization, 
and  turnover  of  cell-surface  receptors.  Loss  of  these  proteins 
affected  abundance,  cell-surface  localization,  and  endocytic 
trafficking  of  Notch,  EGFR,  and  other  signaling  and  adhesion  receptors 
in  epithelial  cells.  As  mentioned  above,  we  found  that  the  Merlin 
protein  co-localized  with  the  Wg  protein  in  the  plasma  membrane  of 
cells  at  the  D/V  compartment  border  of  the  wing  imaginal  disc  (Figure 
5) .  We  also  observed  that  the  Mer^  mutant  displayed  a  slightly  more 
diffuse  Wg  stripe  at  the  D/V  compartment  border.  In  light  of  the  fact 
that  the  Wg  morphogen  is  distributed  through  the  tissues  by  vesicular 
trafficking  (Entchev  and  Gonzalez-Gaitan,  2002),  we  hypothesize  that 
Merlin  mutation  may  affect  this  process. 


Figure  6.  Wg  protein  distribution  in  the  wild- type  and  Mer4  mutant  wing 
imaginal  discs.  The  wing  imaginal  discs  were  stained  with  an  anti-Wg 
antibody .  The  Wg  stripe  (red)  was  readily  detected  at  the  D/V  compartment 
border  of  the  wild- type  wing  pouch  (A)  .  At  a  higher  magnification ,  the  Wg 
protein  showed  a  cortical  localization  in  the  cell  (B) .  However ,  the  Wg 
stripe  at  the  D/V  border  of  the  mer^  mutant  disc  appeared  more  diffuse  (C)  , 
particularly  in  some  regions  of  the  Wg  stripe  (arrows) .  An  enlarged  area  of 
a  diffused  region  (arrowhead  in  C)  showed  that  while  the  cortical 
localization  of  the  Wg  protein  was  still  seen,  Wg  granules  were  diffusely 
distributed  (D) . 


To  test  this  hypothesis,  we  performed  experiments  to  examine 
potential  genetic  interactions  between  Merlin  and  the  proteins 
involved  in  vesicular  trafficking.  We  first  searched  FlyBase  for 
available  strains  carrying  a  UAS  construct  with  a  gene  participating 
in  vesicular  trafficking,  or  having  an  EP-element  insertion  near  or 
in  the  genes  of  interest.  We  identified  fly  strains  carrying  the  UAS 
construct  for  sgl,  frc,  Rab5,  Rab7°^,  Csp,  or  as  well  as 

strains  with  an  EP-insertion  for  Damp,  garnet,  a-Adaptin ,  Scamp, 

Cirl,  Gdi ,  Rop,  lap,  or  AP-47.  We  used  the  wing-pouch  specific  Gal4 
driver  1096  to  ectopically  express  these  constructs  and  examined  any 
change  in  the  wing  morphology.  Among  the  strains  examined,  only 
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ectopic  expression  of  pore,  ,  a  dominant  negative  shi  allele 

(shi^^)  ,  and  lap  gave  rise  to  abnormal  wing  morphology. 

The  porcupine  (pore)  is  a  segment  polarity  gene  and  encodes  a 
putative  multi-pass  transmembrane  protein  belonging  to  the  membrane- 
bound  0-acyltransf erase  superfamily  (Hofmann,  2000)  .  Genetic  and 
immunocytochemical  studies  suggest  that  pore  is  required  for  the 
secretion  of  active  Wg  ligand.  The  Porcupine  (Pore)  protein 
stimulates  the  posttranslational  N-glycosylation  of  Wg  in  the 
endoplasmic  reticulum.  Ectopic  expression  of  Pore  stimulates  the  N- 
glycosylation  of  both  endogenously  and  exogenously  expressed  Wg 
(Tanaka  et  al . ,  2002) .  The  shibire  (shi)  encodes  a  dynamin  protein, 
a  GTPase  essential  for  cytokinesis  and  endocytosis  (van  der  Bliek  and 
Meyerowitz,  1991;  Kitamoto,  2002) .  The  Shibire  (Shi)  protein  is 
known  to  be  involved  in  vesicle  trafficking  of  various 
neurotransmitters  and  the  Wg  protein  during  early  embryogenesis 
(Strigini  and  Cohen,  2000)  .  The  lap  (like-AP180)  gene  encodes  a 
presynaptically  enriched  clathrin  adaptor  protein.  The  Lap  protein 
plays  an  important  role  in  clathrin-mediated  endocytosis  of  synaptic 
vesicles  (SVs)  and  regulates  the  size  of  SVs  (Zhang  et  al .  1998) . 

Figure  7.  Potential  genetic  interaction  between  Mer  and  pore.  (A) 
Overexpression  of  Mer'^^^  in  the  wing  pouch  using  the  1096  driver  did  not 
affect  the  overall  morphology  of  the  wing.  (B)  Overexpression  of  pore  in  the 
wing  pouch  completely  disrupted  the  wing  morphology .  (C)  Overexpression  of 

pore  together  with  Mer^^^  resulted  in  some  flies  with  normal  wing  morphology . 
(D)  Over-expression  of  pore  together  with  Mer^  completely  rescued  the 
overall  wing  morphology . 


Last  year,  we  briefly  reported  the  potential  genetic  interaction 
between  Mer  and  pore,  and  between  Mer  and  shi°^  (shi^^^^)  .  We  have 
extended  our  analysis  and  showed  that  over-expression  of  Mer^^^,  a 
dominant-negative  Merlin  mutant  with  a  deletion  of  the  Blue-Box  in 
the  FERM  domain  of  Merlin  (LaJeunesse  et  al .  ,  1998),  in  the  wing 
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pouch  using  the  1096  driver  resulted  in  loss  of  the  posterior  cross 
vein  and  a  slightly  enlarged  wing,  but  did  not  affect  the  overall 
wing  structure  (Figure  7A) .  In  contrast,  over-expression  of  pore  in 
the  wing  pouch  completely  disrupted  the  wing  structure,  yielding 
abnormally  small  wing  with  no  veins  (Figure  7B) .  Interestingly,  when 
pore  was  over-expressed  together  with  Mer'^^^  using  the  same  1096 
driver,  flies  with  normal  wing  morphology  were  recovered;  however, 
the  defect  in  the  posterior  cross  vein  was  still  present,  presumably 
due  to  the  effect  of  Mer^^^  over-expression  (Figure  1C) .  Similarly, 
when  pore  was  over-expressed  together  with  a  wild-type  Mer  (Mer^)  , 
flies  with  completely  normal  wing  morphology  including  cross  veins 
were  obtained  (Figure  7D) . 

Figure  8.  The  effect  of  over-expression  of  Mer'^^^  (A  and  E)  ,  pore  (B  and 
F)  ,  pore  plus  Mer'^^^  (C  and  G)  ,  and  pore  plus  Mer^  (D  and  H)  on  Wg  expression 
at  the  D/V  compartment  border.  (A-D)  Immunostaning  for  Merlin.  (E-H) 
Immunostaining  for  Wg.  White  arrowhead  show  the  Wg  stripe  of  expression  at 
D/V  compartment  border. 
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As  mentioned  above,  the  Wg  morphogen  normally  is  synthesized  in  the 
cells  at  the  D/V  compartment  border  of  the  wing  imaginal  disc.  Wg  is 
secreted  by  these  cells  and  spreads  over  the  entire  wing  disc  tissue 
in  a  gradient  fashion.  Wg  is  a  long-range  morphogen  and  its  gradient 
activates  target  genes  as  a  function  of  distance  from  the  source.  Wg 
is  essential  for  embryonic  development;  however,  strong  viable 
alleles  of  wg  have  been  isolated  and  could  result  in  flies  with  no 
wings.  We  examined  the  effect  of  overexpression  of  pore, 

Mer^^^  plus  pore,  and  pore  plus  Mer'^  in  the  wing  pouch  on  Wg 
expression.  Immunostaining  with  an  anti-Merlin  antibody  revealed 
that  the  1096  driver,  as  predicted,  was  active  in  the  wing  pouch, 
resulting  in  overexpression  of  the  Mer'^®®  and  the  wild-type  Merlin 
protein  (Figure  8A-D) .  When  Mer^^^  was  overexpressed  in  the  wing 
pouch,  no  obvious  effect  on  the  Wg  stripe  at  the  D/V  compartment 
border  was  seen  (Figure  8E) .  Intriguingly,  overexpression  of  Pore 
resulted  in  a  complete  loss  of  the  Wg  stripe  (Figure  8F) . 

Simultaneous  overexpression  of  Mer'^^^  or  Mer+  (the  wild-type  Merlin) 
with  Pore  led  to  re-appearance  of  the  Wg  stripe  at  the  D/V 
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compartment  border;  however,  a  broad  Wg  stripe  was  observed 
particularly  in  the  case  of  simultaneous  over-expression  of  Pore  and 
(Figures  8G  and  8H)  .  Together,  these  results  suggest  a 
potential  interaction  between  the  pore  and  Mer  genes.  Such  a  genetic 
interaction  has  a  profound  effect  on  the  determination  of  the  Wg 
stripe  at  the  D/V  compartment  border. 

Figure  9.  Potential  genetic  interaction  between  Mer  and  shi .  Over¬ 
expression  of  Mer^^^  in  the  wing  pouch  using  the  1096  driver  did  not  affect 
the  overall  morphology  of  the  wing.  Over-expression  of  shi^^^^  in  the  wing 
pouch  completely  disrupted  the  wing  morphology .  Over-expression  of  shi^^^^ 
together  with  Mer^^^  partially  restored  the  formation  of  staut  bristles  in 
the  wing  margin.  Simultaneous  over-expression  of  shi^^^^  and  Mer^  also 
partially  restored  the  formation  of  triple  rows  including  the  medial  triple 
row  of  staut  bristles  and  ventral  and  dorsal  triple  rows  of  sensory 
bristles . 
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We  have  also  extended  our  analysis  on  the  potential  genetic 
interaction  between  Mer  and  ,  a  dominant  negative  shi  allele. 

Overexpression  of  in  a  wing  pouch  using  the  1096  driver 

resulted  in  flies  with  abnormally  small  wings,  which  did  not  have  any 
veins  or  staut  bristles,  but  had  some  disorganized  sensory  bristles 
(Figure  9) .  Staut  bristles  originate  from  the  Wg-expressing  cells  at 
the  D/V  compartment  border  of  the  wing  imaginal  disc.  The 
disappearance  of  staut  bristles  may  imply  that  these  cells  did  not 
undergo  normal  differentiation  processes  during  wing  development. 
Interestingly,  when  was  over-expressed  together  with  in 

the  wing  disc  using  the  1096  driver,  a  partial  restoration  of  triple 
rows  including  the  medial  triple  row  of  staut  bristles  was  observed; 
however,  the  wing  remained  small  in  size  and  had  no  vein  (Figure  9) . 
Similarly,  simultaneous  over-expression  of  Mer^  and  in  the  wing 

pouch  partially  restored  the  formation  of  triple  rows  including  the 
medial  triple  row  of  staut  bristles  and  ventral  and  dorsal  triple 
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rows  of  sensory  bristles.  However,  the  spacing  and  organization  of 
ventral  and  dorsal  triple  rows  still  appeared  abnormal,  and  there 
were  no  veins  in  the  wing.  These  results  suggest  a  potential  genetic 
interaction  between  Merlin  and  shihire . 

Furthermore,  we  found  that  over-expression  of  lap,  which  encodes  a 
clathrin  adaptor  protein,  in  the  wing  pouch  using  the  1096  driver 
resulted  in  wings  with  ectopic  vein  formation,  frequently  seen  at  the 
distal  part  of  vein  V  (Figure  10) .  When  lap  was  over-expressed 
together  with  ,  excessive  ectopic  vein  materials,  which  was  even 

more  than  that  with  lap  over-expression  alone,  appeared  in  many  parts 
of  the  wing  blade.  In  contrast,  simultaneous  over-expression  of  Mer^ 
and  lap  in  the  wing  pouch  resulted  in  wings  with  normal  or  almost 
normal  vein  formation  (Figure  10) .  These  results  indicate  that  the 
dominant-negative  Merlin  Mer'^®®  enhances  the  effect  of  Lap  over¬ 
expression  on  the  vein  fate  determination,  while  the  wild-type  Merlin 
suppresses  it.  LeJeunesse  et  al .  (2001)  have  proposed  that  Merlin 

functions  to  antagonize  the  epidermal  growth  factor  receptor  (EGFR) 
pathway  which  control  vein  fate  determination  in  the  wing.  Maitra  et 
al .  (2006)  further  show  that  Merlin  and  Expanded  function 

cooperatively  to  modulate  receptor  endocytosis  and  signaling 
including  EGFR.  Taken  together,  our  data  suggest  that  Merlin 
counteracts  with  Lap,  which  is  important  for  receptor  endocytosis. 
Merlin  may  interact  with  Lap  directly  or  indirectly.  Through  Lap, 
Merlin  regulates  the  EGFR  pathway  required  for  vein  fate 
determination  in  the  wing  (Figure  11) . 

Figure  10.  Potential  genetic  interaction  between  Mer  and  lap.  Over¬ 
expression  of  Mer^^^  in  the  wing  pouch  using  the  1096  driver  did  not  affect 
the  overall  morphology  of  the  wing.  Over-expression  of  lap  in  the  wing 
pouch  completely  disrupted  the  wing  morphology .  Over-expression  of  lap 
together  with  Mer^^^  resulted  in  some  flies  with  normal  wing  morphology . 
Over-expression  of  lap  together  with  Mer^  completely  rescued  the  overall 
wing  morphology.  Black  arrowheads  denote  the  sites  of  ectopic  vein  material . 
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Figure  11.  Model  for  Merlin's  function  through  Lap  to  regulate  the  EGF 
pathway,  which  is  required  for  vein  fate  determination. 
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Figure  12.  The  region  of  Merlin  required  for  the  genetic  interaction 
with  Lap.  Arrowheads  point  to  ectopic  vein  materials . 


To  examine  which  region  of  Merlin  was  required  for  the  genetic 
interaction  with  Lap,  we  simultaneously  over-expressed  Lap  together 
with  various  truncated  Merlin  proteins  in  the  wing  pouch  using  the 
1096  driver.  When  Mer^“^®^  was  simultaneously  over-expressed  together 
with  Lap  in  the  wing  disc,  ectopic  vein  formation  was  still  detected 
in  the  wing,  similar  to  that  seen  in  the  case  of  Lap  over-expression 
alone  (Figure  12)  .  When  other  N-terminal  fragments  of  Merlin,  Mer^”^^° 
and  Mer^“^^^,  were  over-expressed  together  with  Lap,  ectopic  vein 
materials  continued  to  be  seen.  Also,  when  the  first  600  amino  acids 
of  Merlin,  Mer^"®°°,  was  co-expressed  with  Lap  in  the  wing  disc,  some 
ectopic  veins  were  still  present  in  the  wing;  however,  the  amount  of 
ectopic  vein  material  appeared  to  be  reduced.  Importantly,  when  the 
c-terminal  fragment  of  Merlin,  Mer^^^”®^^,  was  co-expressed  with  Lap, 
the  normal  wing  was  restored,  similar  to  what  was  observed  when  Mer^ 
was  simultaneously  over-expressed  together  with  Lap.  These  results 
indicate  that  the  C-terminal  region  of  the  Merlin  protein  is 
important  for  the  genetic  interaction  with  Lap.  Further  investigation 
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of  this  interaction  will  be  needed  to  better  understand  the  role  of 
Merlin  in  vesicular  trafficking. 

Because  the  Mer^  allele  was  viable  but  sterile  and  because  many 
proteins  involved  in  exocytosis/endocytosis  are  also  important  for 
spermatogenesis,  we  previously  examined  the  Mer^  mutant  for  any 
defects  in  this  process.  We  found  that  the  Mer^  mutant  displayed 
abnormalities  in  cyst  polarization  during  spermatogenesis.  The  Merlin 
protein  normally  localized  in  the  acrosome  of  mature  sperm,  but  this 
localization  was  altered  in  the  Mer^  mutant.  We  now  extended  our 
finding  to  the  Mer^  mutant.  We  found  that  the  adult  male  pharates 
carrying  the  Mer'^  allele  displayed  a  more  severe  defect  in  cyst 
polarization . 

We  presented  our  findings  on  the  Role  of  Drosophila  Merlin  in 
Spermatogenesis  and  Wg  Morphogen  Trafficking  in  the  Imaginal  Disc  to 
the  2006  CTF  International  Consortium  for  the  Molecular  Biology  of 
NFl,  NF2,  and  Schwannomatosis  (Omelyanchuk  et  al . ,  2006) . 

Task  10:  To  study  the  distribution  of  the  Merlin  protein  in  the  wing 
imaginal  disc,  we  are  in  the  process  of  preparing  additional 
antibody.  A  peptide  with  the  amino  acid  sequence  covering  the 
putative  phosphorylation  site  of  Drosophila  Merlin  was  synthesized, 
conjugated  to  a  carrier  protein,  and  used  in  immunizing  rabbits. 
Antibody  response  against  the  Merlin  peptide  will  be  tested  in  the 
near  future . 


Aim  4 :  To  investigate  whether  NF2~^~  mouse  schwannoma  cells  also  show 
cytological  abnormalities  in  mitosis  similar  to  those  seen  in  the 
Drosophila  imaginal  discs . 

Task  11:  This  task  was  accomplished  and  we  have  obtained  the 
j^f2fiox2/fiox2  POCre  mice  for  the  experiments  described  below. 

Task  12:  To  generate  a  conditional  Nf2  knockout  in  Schwann  cells,  the 
]^f2fiox2/fiox2  were  mated  with  the  POCre  mice.  Expression  of  the 

Cre  recombinase  in  Schwann  cells  (specified  by  the  myelin  protein  PO 
promoter)  would  result  in  deletion  of  exon  2  of  the  Nf2  gene 
(POCre ;  .  Giovannini  et  al .  (2000)  previously  showed  that 

conditional  inactivation  of  the  Nf2  gene  in  Schwann  cells  in  mice 
could  lead  to  Schwann  cell  hyperplasia  and  schwannoma  formation,  and 
the  tumors  usually  appeared  10  months  or  later  after  Nf2 
inactivation.  We  have  observed  that  the  POCre ;  mice  developed 

schwannomas  at  about  one  year  of  age.  Histopathological  examination 
revealed  that  the  tumor  was  a  grayish,  creamy  globoid  mass. 
Histologically  consisted  of  spindle-shaped  cells  with  mitotic  figures 
(Figure  13) .  We  are  in  the  process  of  harvesting  schwannoma  tumors 
for  cytological  analysis. 

Task  13:  We  have  also  bred  a  colony  of  (phenotypically 

normal)  and  (heterozygous  Nf2)  mice.  We  are  planning  to 

isolate  dorsal  root  ganglion  from  these  mice  for  the  isolation  of 
Schwann  cells.  Alternatively  we  will  harvest  sciatic  nerve  for 
Schwann  cell  preparation. 
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Figure  13.  Histological  analysis  of  a  schwannoma  produced  in  the 
POCre ;  mouse  with  a  deletion  of  Exon  2  of  the  Nf2  gene  in 

Schwann  cell.  Note  that  the  tumor  consisted  of  actively  growing, 
spindle-shaped  cells. 


Task  14:  We  will  conduct  the  cytological  analysis  of  schwannoma  cells 
and  Schwann  cells  in  year  3. 

Task  15:  During  the  past  year,  one  research  abstract  was  presented  to 
the  2006  CTF  International  Consortium  for  the  Molecular  Biology  of 
NFl,  NF2,  and  Schwannomatosis .  To  extend  our  knowledge  about 
potential  merlin  functions  in  various  species,  we  analyzed  the 
evolution,  diversity,  and  overall  distribution  of  merlin  among 
different  taxa.  Our  results  demonstrate  a  monophyletic  origin  of  the 
merlin  proteins  with  their  root  in  the  early  metazoa.  The  overall 
similarity  among  the  primary  and  secondary  structures  of  all  merlin 
proteins  and  the  conservation  of  several  functionally  important 
residues  suggest  a  universal  role  for  merlin  in  a  wide  range  of 
metazoa.  These  findings  were  published  recently  (Golovnina  et  al . , 
2005) . 


KEY  RESEARCH  ACCOMPLISHMENTS: 

(1)  Two  types  of  mitosis  exit  asynchrony,  the  asynchronous 

anaphase-telophase  figures  and  the  asynchronous  telophase-interphase 
figures,  were  frequently  observed  in  various  Merlin  mutants,  Mer^ 
(Gln324^stop)  ,  Mer^  ,  Mer^  ,  and  Mer^  (Gln^^o^stop^  _ 

The  asynchronous  mitosis  exit  phenotype  in  the  Mer'^  genetic  background 
could  be  corrected  by  the  introduction  of  a  wild-type  Merlin 
transgene  was  substantially  diminished.  These  results  suggest  that 
Merlin  is  important  for  the  control  of  mitosis  exit 

(2)  Examination  of  the  effect  of  Merlin  mutation  on  cell 
proliferation  in  different  regions  of  the  wing  imaginal  disc  reveals 
that  cells  lacking  Merlin  possess  greater  ability  to  overcome  vein 
restriction.  The  results  further  support  the  role  of  Merlin  in  cell 
motility,  cell  adhesion  and  cell  proliferation. 
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(3)  We  showed  that  the  Merlin  and  Wg  proteins  were  colocalized  in 
the  cells  at  the  D/V  compartment  border  of  the  wing  imaginal  disc. 
Merlin  inactivation  may  lead  to  an  alteration  on  the 
determination/maintenance  of  Wg  stripe  expression. 

(4)  We  have  extended  our  analysis  on  the  potential  genetic 
interactions  between  Merlin  and  the  proteins  involved  in  vesicular 
trafficking.  In  addition  to  the  potential  genetic  interaction  between 
the  Merlin  and  porcupine  genes  and  between  the  Merlin  and  shibire 
genes,  an  interaction  between  the  Merlin  and  lap  (like-AP180) ,  which 
encodes  a  clathrin  adaptor  protein  important  for  clathrin-mediated 
endocytosis  of  synaptic  vesicles,  was  identified.  Our  results 
indicate  that  Merlin  counteracts  with  Lap  and  suggest  that  through 
Lap,  Merlin  may  regulate  the  EGFR  pathway  required  for  vein  fate 
determination  in  the  wing. 

(5)  To  better  understand  merlin  function  across  different  species, 
we  conducted  a  study  on  the  evolution,  diversity,  and  overall 
distribution  of  merlin  among  different  taxa  (Golovnina  et  al . ,  2005) . 
We  found  a  monophyletic  origin  of  the  merlin  proteins  with  their  root 
in  the  early  metazoa.  Also,  we  compared  the  overall  similarity  among 
the  primary  and  secondary  structures  of  all  merlin  proteins  and 
identified  several  conserved  and  functionally  important  residues. 

Our  results  suggest  a  universal  role  for  merlin  in  a  wide  range  of 
metazoa . 


REPORTABLE  OUTCOMES: 

One  research  abstract  was  presented  to  the  2006  CTF  International 
Consortium  for  the  Molecular  Biology  of  NFl,  NF2,  and 
Schwannomatosis .  Also,  one  research  paper  was  published  in  BMC 
Evolutionary  Biology . 

Abstract 

(1)  Omelyanchuk,  L.V.,  Dorogova,  N.V.,  Kopyl,  S.,  Akhmameteva,  E.M., 
Fehon,  R.G.,  and  Chang,  L.-S.  2006.  The  Role  of  Drosophila  Merlin 
in  Spermatogenesis  and  Wg  Morphogen  Trafficking  in  the  Imaginal 
Disc.  Abstract  presented  to  the  2006  CTF  International  Consortium 
for  the  Molecular  Biology  of  NFl,  NF2,  and  Schwannomatosis. 

Previously  we  reported  that  the  viable,  but  sterile  Merlin 
mutant  Mer^  (Met^^^^Ile)  displayed  abnormalities  in  cyst 
polarization  during  spermatogenesis.  The  Merlin  protein  normally 
localized  in  the  acrosome  of  mature  sperm,  but  this  localization 
was  altered  in  the  Mer^  mutant.  We  now  showed  that  a  more  severe 
defect  in  cyst  polarization  could  be  seen  in  the  adult  male 
pharates  carrying  the  Mer^  allele  (Gln^^°-^stop)  .  We  also  found 
that  both  the  clathrin  mutant  Chc4 ,  known  to  have  abnormal  cyst 
individualization,  and  the  meiotic  mutant  ffl6  displayed  defective 
nuclei  polarization  and  nuclear  shaping.  Because  Merlin  has  been 
shown  to  associate  with  endocytic  compartments  and  because 
mutations  in  the  genes,  such  as  clathrin  and  ffl6,  that  are  known 
to  be  important  for  vesicle  formation  and  cytokinesis,  also  affect 
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nuclei  polarization,  we  examined  whether  Merlin  is  involved  in  the 
vesicular  traffic  in  somatic  tissues.  We  first  examined  potential 
interaction  between  Merlin  and  shihere ,  a  dynamin  participating  in 
various  microtubule-mediated  processes  such  as  cytokinesis  and 
endocytosis.  Ectopic  expression  of  a  dominant-negative  mutant  of 
shihere  {shi'^^)  ,  by  the  1096  wing  pouch  driver  led  to  a 

disrupted  wing  morphology  including  the  loss  of  MTR  staut 
bristles.  The  shiDN  wing  margin  phenotype  was  rescued  by 
simultaneous  introduction  of  a  UAS-Mer  construct  carrying  the  Mer^ 
or  Mer^~^°°  transgene.  Partial  restoration  of  the  phenotype  was 
also  seen  when  truncated  Merlin  expression  constructs  Mer^~^^°  and 
Mer^~^'^^  were  used,  while  no  restoration  was  found  with  Mer^~^^^  and 
Mer^^^ .  These  results  suggest  that  Merlin  plays  a  role  in  the 
vesicular  traffic  and  the  FERM  domain  including  the  Blue  Box  is 
required  for  the  interaction  with  Shibere.  The  MTR  staut  bristles 
are  derived  from  the  cells  normally  expressing  wingless  (Wg) 
morphogen  whose  movement  through  the  tissue  is  related  to  the 
vesicular  traffic.  We  found  that  the  expression  pattern  of  patched 
(ptc) ,  which  marks  the  A/P  compartment  border  in  the  wing  imaginal 
disc,  was  not  changed  in  the  Mer^  mutant,  suggesting  that  the 
Decapentaplegic  (Dpp)  morphogen  trafficking  is  likely  not  affected 
by  Merlin  mutation.  In  contrast,  the  stripe  expression  pattern  of 
Wg  at  the  D/V  compartment  border  was  altered  in  the  Mer'^  mutant. 
Intriguingly,  wg-lacZ  insertion  in  the  Mer^  background  revealed  no 
change  in  the  wg  regulatory  zone  as  compared  with  that  in  the 
wild-type  control.  While  the  expression  of  neuralized,  which 
participates  in  the  determination  of  dTR  and  vTR  bristles,  was 
controlled  by  Wg,  its  expression  pattern  was  significantly 
deviated  in  the  Mer^  mutant.  In  addition,  the  Wg-regulated  cycE 
expression  pattern  at  the  D/V  border  was  also  affected  by  the  Mer^ 
mutation.  These  results  suggest  that  Merlin  plays  an  important 
role  in  Wg  trafficking.  It  has  been  shown  that  Porcupine  (Pore) 
facilitates  Wg  glycosylation  in  the  endoplasmic  reticulum  and  the 
wing  margin  is  subjected  to  Wg  regulation.  We  found  that  both  Mer^ 
and  Mer^  mutations  did  not  significantly  affect  the  wing  margin 
morphology  and  only  caused  additional  sensory  bristles  within  the 
row  of  staut  bristles.  While  overexpression  of  Mer^  or  Mer^^^  did 
not  change  the  wing  margin  pattern,  overexpression  of  Pore  led  to 
complete  disappearance  of  Staut  bristles  and  irregularities  of 
sensory  and  mechano-sensory  bristles.  In  contrast,  simultaneous 
overexpression  of  Mer  and  pore  restored  the  staut  bristles  and 
normalized  the  arrangement  of  mechano-  and  chemo-sensory  bristles. 
In  addition,  while  overexpression  of  Mer^™  did  not  affect  Wg 
transcription  and  protein  expression  in  the  wing  imaginal  disc, 
pore  overexpression  results  in  complete  absence  of  the  Wg  protein 
at  the  D/V  border.  Importantly,  overexpression  of  both  pore  and 
Mer^  or  Mer^^^  restored  the  Wg  stripe  at  the  D/V  border.  These 
results  suggest  that  overexpression  of  Merlin  may  facilitates  Wg 
secretion.  Together,  our  data  support  the  notion  that  Merlin 
participates  in  the  vesicular  traffic. 

Publication 

(1)  Golovnina,  K. ,  Blinov,  A.,  Akhmametyeva ,  E.M.,  Omelyanchuk,  L.V., 
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and  Chang,  L.-S.  2005.  Evolution  and  Origin  of  Merlin,  the  Product 
of  the  Neurofibromatosis  Type  2  Tumor-Suppressor  Gene.  BMC 
Evolutionary  Biology  5:69-86. 

In  this  paper,  we  examined  the  evolution ,  diversity ,  and 
overall  distribution  of  merlin  among  different  taxa .  By  combining 
bioin formatic  and  phylogenetic  approaches ,  we  demonstrate  that 
merlin  homologs  are  present  across  a  wide  range  of  metazoan 
lineages .  While  the  phylogenetic  tree  shows  a  monophyletic  origin 
of  the  ERM  family,  the  origin  of  the  merlin  proteins  is  robustly 
separated  from  that  of  the  ERM  proteins .  The  derivation  of  merlin 
is  thought  to  be  in  early  metazoa .  We  have  also  observed  the 
expansion  of  the  ERM-like  proteins  within  the  vertebrate  clade, 
which  occurred  after  its  separation  from  Urochordata  (Ciona 
intestinalis ) .  Amino  acid  sequence  alignment  reveals  the  absence 
of  an  actin-binding  site  in  the  C-terminal  region  of  all  merlin 
proteins  from  various  species  but  the  presence  of  a  conserved 
internal  binding  site  in  the  N-terminal  domain  of  the  merlin  and 
ERM  proteins .  In  addition ,  a  more  conserved  pattern  of  amino  acid 
residues  is  found  in  the  region  containing  the  so-called  "Blue 
Box,"  although  some  amino  acid  substitutions  in  this  region  exist 
in  the  merlin  sequences  of  worms,  fish,  and  Ciona.  Examination  of 
sequence  variability  at  functionally  significant  sites,  including 
the  serine-518  residue ,  the  phosphorylation  of  which  modulates 
merlin's  intra-molecular  association  and  function  as  a  tumor 
suppressor ,  identifies  several  potentially  important  sites  that 
are  conserved  among  all  merlin  proteins  but  divergent  in  the  ERM 
proteins .  Secondary  structure  prediction  reveals  the  presence  of 
a  conserved  a-helical  domain  in  the  central  to  C-terminal  region 
of  the  merlin  proteins  of  various  species .  The  conserved  residues 
and  structures  identified  correspond  to  the  important  sites 
highlighted  by  the  available  crystal  structures  of  the  merlin  and 
ERM  proteins .  Furthermore,  analysis  of  the  merlin  gene  structures 
from  various  organisms  reveals  the  increase  of  gene  length  during 
evolution  due  to  the  expansion  of  introns ;  however ,  a  reduction  of 
intron  number  and  length  appears  to  occur  in  the  merlin  gene  of 
the  insect  group.  Our  results  demonstrate  a  monophyletic  origin  of 
the  merlin  proteins  with  their  root  in  the  early  metazoa .  The 
overall  similarity  among  the  primary  and  secondary  structures  of 
all  merlin  proteins  and  the  conservation  of  several  functionally 
important  residues  suggest  a  universal  role  for  merlin  in  a  wide 
range  of  metazoa . 


CONCLUSIONS : 

The  role  of  Merlin  in  the  control  of  mitosis  exit  has  been 
confirmed  by  the  use  of  various  Merlin  mutants.  Merlin  mutations  lead 
to  two  types  of  mitosis  exit  asynchrony,  the  asynchronous  anaphase- 
telophase  figures  and  the  asynchronous  telophase-interphase  figures. 
Cells  lacking  Merlin  also  possess  greater  ability  to  overcome  vein 
restriction.  The  Merlin  protein  is  colocalized  with  the  Wingless 
morphogen  in  the  cells  at  the  dorsal/ventral  compartment  border  of 
the  wing  imaginal  disc.  Merlin  inactivation  may  lead  to  an 
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alteration  on  the  determination/maintenance  of  Wg  stripe  expression. 
In  addition  to  the  potential  genetic  interaction  between  the  Merlin 
and  porcupine  genes  and  between  the  Merlin  and  shihire  genes,  an 
interaction  between  the  Merlin  and  lap  (like-AP180) ,  which  encodes  a 
clathrin  adaptor  protein  important  for  clathrin-mediated  endocytosis 
of  synaptic  vesicles,  was  identified.  Our  results  indicate  that 
Merlin  counteracts  with  Lap  and  through  Lap,  Merlin  may  regulate  the 
EGFR  pathway  required  for  vein  fate  determination  in  the  wing.  By 
analyzing  the  evolution,  diversity,  and  overall  distribution  of 
merlin  among  different  taxa,  we  demonstrate  a  monophyletic  origin  of 
the  merlin  proteins  with  their  root  in  the  early  metazoa.  The  overall 
similarity  among  the  primary  and  secondary  structures  of  all  merlin 
proteins  and  the  conservation  of  several  functionally  important 
residues  suggest  a  universal  role  for  merlin  in  a  wide  range  of 
metazoa . 
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ABSTRACT 


Presently,  the  mechanism  by  which  Merlin  functions  as  a  tumor 
suppressor  is  not  understood.  By  utilizing  Drosophila  genetics,  we 
have  found  a  role  of  Merlin  in  the  control  of  mitosis  exit.  Merlin 
mutations  lead  to  two  types  of  mitosis  exit  asynchrony,  the 
asynchronous  anaphase-telophase  figures  and  the  asynchronous 
telophase-interphase  figures.  Also,  we  show  that  cells  lacking 
Merlin  possess  greater  ability  to  overcome  vein  restriction  in  the 
wing.  The  Merlin  protein  is  colocalized  with  the  Wingless  morphogen 
in  the  cells  at  the  dorsal/ventral  compartment  border  of  the  wing 
imaginal  disc.  Merlin  inactivation  may  lead  to  an  alteration  on  the 
determination/maintenance  of  Wg  stripe  expression.  We  have  found 
potential  genetic  interactions  between  the  Merlin  and  porcupine  genes 
and  between  the  Merlin  and  shibire  genes.  We  also  discover  an 
interaction  between  the  Merlin  and  lap  (like-AP180) ,  which  is 
important  for  clathrin-mediated  endocytosis  of  synaptic  vesicles,  was 
identified.  Our  results  suggest  that  Merlin  counteracts  with  Lap  and 
through  Lap,  Merlin  may  regulate  the  EGFR  pathway  required  for  vein 
fate  determination  in  the  wing.  In  addition,  by  analyzing  the 
evolution,  diversity,  and  overall  distribution  of  merlin  among 
different  taxa,  we  demonstrate  a  monophyletic  origin  of  the  merlin 
proteins  with  their  root  in  the  early  metazoa.  The  overall  similarity 
among  the  primary  and  secondary  structures  of  all  merlin  proteins  and 
the  conservation  of  several  functionally  important  residues  suggest  a 
universal  role  for  merlin  in  a  wide  range  of  metazoa. 
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Abstract 

Background:  Merlin,  the  product  of  the  Neurofibromatosis  type  2  (NF2)  tumor  suppressor  gene,  belongs  to  the  ezrin- 
radixin-moesin  (ERM)  subgroup  of  the  protein  4. 1  superfamily,  which  links  cell  surface  glycoproteins  to  the  actin 
cytoskeleton.  While  merlin's  functional  activity  has  been  examined  in  mammalian  and  Drosophila  models,  little  is 
understood  about  its  evolution,  diversity,  and  overall  distribution  among  different  taxa. 

Results:  By  combining  bioinformatic  and  phylogenetic  approaches,  we  demonstrate  that  merlin  homologs  are  present 
across  a  wide  range  of  metazoan  lineages.  While  the  phylogenetic  tree  shows  a  monophyletic  origin  of  the  ERM  family, 
the  origin  of  the  merlin  proteins  is  robustly  separated  from  that  of  the  ERM  proteins.  The  derivation  of  merlin  is  thought 
to  be  in  early  metazoa.  We  have  also  observed  the  expansion  of  the  ERM-like  proteins  within  the  vertebrate  clade,  which 
occurred  after  its  separation  from  Urochordata  (dona  intestinalis).  Amino  acid  sequence  alignment  reveals  the  absence 
of  an  actin-binding  site  in  the  C-terminal  region  of  all  merlin  proteins  from  various  species  but  the  presence  of  a 
conserved  internal  binding  site  in  the  N-terminal  domain  of  the  merlin  and  ERM  proteins.  In  addition,  a  more  conserved 
pattern  of  amino  acid  residues  is  found  in  the  region  containing  the  so-called  "Blue  Box,"  although  some  amino  acid 
substitutions  in  this  region  exist  in  the  merlin  sequences  of  worms,  fish,  and  Ciona.  Examination  of  sequence  variability 
at  functionally  significant  sites,  including  the  serine-518  residue,  the  phosphorylation  of  which  modulates  merlin's  intra¬ 
molecular  association  and  function  as  a  tumor  suppressor,  identifies  several  potentially  important  sites  that  are  conserved 
among  all  merlin  proteins  but  divergent  in  the  ERM  proteins.  Secondary  structure  prediction  reveals  the  presence  of  a 
conserved  a-helical  domain  in  the  central  to  C-terminal  region  of  the  merlin  proteins  of  various  species.  The  conserved 
residues  and  structures  identified  correspond  to  the  important  sites  highlighted  by  the  available  crystal  structures  of  the 
merlin  and  ERM  proteins.  Furthermore,  analysis  of  the  merlin  gene  structures  from  various  organisms  reveals  the 
increase  of  gene  length  during  evolution  due  to  the  expansion  of  introns;  however,  a  reduction  of  intron  number  and 
length  appears  to  occur  in  the  merlin  gene  of  the  insect  group. 

Conclusion:  Our  results  demonstrate  a  monophyletic  origin  of  the  merlin  proteins  with  their  root  in  the  early  metazoa. 
The  overall  similarity  among  the  primary  and  secondary  structures  of  all  merlin  proteins  and  the  conservation  of  several 
functionally  important  residues  suggest  a  universal  role  for  merlin  in  a  wide  range  of  metazoa. 
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Background 

The  advancement  in  genome  sequencing  projects,  the 
accumulation  of  knowledge  in  hioinformatics,  and  the 
molecular  genetic  analysis  of  genes  and  their  functions  in 
a  variety  of  model  organisms  provides  us  with  an  unprec¬ 
edented  opportunity  to  identify  novel  genes  based  on 
sequences  related  to  characterized  genes  [1].  This  process 
is  conducted  using  pairwise  sequence  comparison  with 
the  understanding  that  genes  form  families  wherein 
related  sequences  likely  share  similar  functions.  Although 
initial  identification  of  new  genes  may  not  yield  a  clear 
indication  of  their  respective  functions,  studies  on  their 
evolution  may  allow  validation  of  their  sequence  identity 
and  provide  information  on  their  putative  functional 
characteristics.  For  genes  evolved  from  duplication  and/or 
adapted  to  different  evolutionary  niches  during  specia- 
tion,  detailed  sequence  comparison  can  provide  addi¬ 
tional  information  regarding  their  biological  and 
biochemical  characteristics  [2]. 

Neurofibromatosis  type  2  (NF2)  is  a  highly  penetrant, 
autosomal  dominant  disorder,  whose  hallmark  is  the 
development  of  bilateral  vestibular  schwannomas  [3,4]. 
The  tumor  suppressor  gene  associated  with  NF2  has  been 
identified  and  termed  the  neurofibromatosis  type  2  gene 
(NF2)  [5,6].  The  NF2  gene  encodes  a  protein  named  mer¬ 
lin,  for  moesin-ezrin-radixin  like  protein,  or  schwan- 
nomin,  a  word  derived  from  schwannoma,  the  most 
prevalent  tumor  seen  in  NF2.  For  simplicity,  we  refer  to 
the  NF2  gene  product  as  merlin  hereafter. 

Merlin  shares  sequence  similarity  with  the  ezrin,  radixin, 
and  moesin  (ERM)  proteins,  which  belong  to  the  protein 
4.1  superfamily  of  cytoskeleton-associated  proteins  that 
link  cell  surface  glycoproteins  to  the  actin  cytoskeleton 
[7,8].  Like  ERM  proteins,  merlin  consists  of  three  pre¬ 
dicted  structural  domains  [5,6,9].  The  N-terminal 
domain,  termed  the  FERM  (F  for  4.1)  domain,  is  highly 
conserved  among  all  members  of  the  ERM  family  and  is 
important  for  interactions  with  cell  surface  glycoproteins, 
including  CD44  and  intercellular  adhesion  molecules 
[10-13].  Crystal  stmeture  analysis  shows  that  the  tertiary 
structure  of  the  FERM  domain  of  merlin  closely  resembles 
those  of  the  FERM  domain  of  moesin  and  radixin  [14-18]. 
The  FERM  domain  of  merlin  exists  as  a  clover-shaped 
molecule  consisting  of  three  structural  subdomains  A,  B, 
and  C,  which  are  homologous  to  lobes  FI,  F2,  and  F3  in 
moesin  and  radixin.  Subdomain  A,  composed  of  residues 
20-100,  possesses  a  ubiquitin-like  fold.  Subdomain  B, 
consisting  of  residues  101-215,  folds  itself  into  a  topol¬ 
ogy  like  that  of  the  acetyl-CoA-binding  protein.  Sub- 
domain  C,  containing  residues  216-313,  adopts  the 
pleckstrin  homology/phosphotyrosine-binding  fold 
found  in  a  broad  range  of  signaling  molecules  [14-16]. 
The  second  half  of  merlin  contains  a  predicted  a-helix 


domain,  which  is  also  present  in  the  ERM  proteins  [19]. 
Although  the  unique  C-terminus  of  merlin  lacks  the  con¬ 
ventional  actin-binding  domain  found  in  the  ERM  pro¬ 
teins  [20,21],  merlin  can  directly  bind  actin  using  the 
residues  at  the  N-terminal  domain  and  indirectly  through 
its  association  with  [311-spectrin  or  fodrin  [22-24]. 

The  merlin  and  ERM  proteins  are  thought  to  be  key  regu¬ 
lators  of  interactions  between  the  actin  cytoskeleton  and 
the  plasma  membrane  in  polarized  cells.  They  act  as 
important  members  of  signal  transduction  pathways  that 
control  cell  growth  and  participate  in  the  sorting  of  mem¬ 
brane  proteins  during  exocjTic  traffic  [25,26].  Flowever, 
unlike  the  ERM  proteins,  merlin  has  a  distinct  function  as 
a  tumor  suppressor  [27].  Growth  suppression  by  merlin  is 
dependent  on  its  ability  to  form  intramolecular  associa¬ 
tions  [28,29].  In  this  regard,  merlin  exists  in  an  "open" 
(inactive  form)  or  "closed"  (active  growth-suppressive 
form)  conformation  that  is  regulated  by  phosphorylation 
[30-35]. 

While  previous  studies  have  focused  primarily  on  the 
functional  analysis  of  merlin,  limited  information  is  avail¬ 
able  about  its  overall  distribution  across  eukaryotes  and 
its  evolution.  A  phylogenetic  study  indicates  that  the 
FERM  domains  of  ERM  homologs  from  sea  urchins, 
Caenorhabditis  elegans,  Drosophila  melanogaster,  and  verte¬ 
brates  share  74-82%  amino  acid  identity  and  have  about 
60%  identity  with  those  of  merlin  [25,36-42].  These  levels 
of  identity  are  exceptionally  high,  suggesting  that  the  pro¬ 
tein  structures  of  the  merlin  and  ERM  proteins  from  these 
species  may  be  well-conserved.  The  most  divergent  ERM 
proteins  are  found  in  tapeworms  and  schistosomes  [36- 
39].  The  FERM  domains  of  these  parasite  proteins  share 
only  44-58%  similarity  to  their  vertebrate  homologs.  The 
high  degree  of  structural  conservation  among  these  pro¬ 
teins  points  to  possible  similarities  or  functional  redun¬ 
dancies.  Intriguingly,  no  FERM  domain-encoding  genes 
have  been  identified  in  the  genome  of  the  yeast  Saccharo- 
myces  cerevisiae,  implying  that  FERM  domains  evolved  in 
response  to  multicellularity,  rather  than  as  a  cytoskeletal 
component  [25]. 

The  goal  of  the  present  study  was  to  expand  our  under¬ 
standing  of  the  taxonomic  diversity  of  merlin  and  the  phy¬ 
logenetic  relationships  using  experimentally  annotated 
and  predicted  sequences.  By  the  integration  of  the  BLAST- 
based  analysis  using  the  available  partial  and  whole 
genome  sequences  with  phylogeny  reconstmetion,  we 
have  generated  an  evolutionary  tree  for  the  entire  ERM- 
family  members  from  various  taxa  and  identified  some 
interesting  details  about  their  phylogenetic  origin.  In 
addition,  we  compared  sequence  variability  at  function¬ 
ally  significant  sites,  including  the  major  phosphorylation 
site  of  merlin,  predicted  the  secondary  structure  of  the 
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Table  I:  The  list  of  the  predicted  and  experimentally  annotated  merlin  and  ERM  proteins  included  in  this  study. 


Species 

Proteins 

UniProtKB/Swiss- 

GenBank 

Entries  from  Genome 

Related  Resources 

Prot  Identifiers 

Accession  No. 

Sequencing  Projects 

Homo  sapiens 

merlin  (NF2) 

P35240 

AAA362I2 

http://www.ncbi.nlm.nih.fOv/entrez/ 

auerv.fc?i?db=?md=Retrieve&doDt=Overview&list  uids=95S8 

ezrin 

PI53I  1 

CAA35893 

radixin 

P3524I 

AAA3654I 

moesin 

P26038 

AAA36322 

Pan  troglodytes 

similar  to  NF2 

XP  51 5061 

httD://www.h?sc.bcm.tmc.edu/Droiects/chimDanzee/ 

Papio  anubis 

merlin 

PS9750 

AA023 1 33 

httD://www.ncbi.nlm. nih.gov/entrez/ 

querv.fcfi?db=fmd=Retrieve&dopt=Overview&list  uids=  1 2965 

Bos  taurus 

ezrin 

P3I976 

AAA305I0 

httD://www.hgsc.bcm.tmc.edu/Droiects/bovine/ 

Sus  scrofa 

radixin 

P26044 

AAB02865 

http://www.tif r.orf/tifr-scripts/tfi/T  index.cfiispec 

moesin 

P26042 

AAB02864 

Canis  familiaris 

similar  to  NF2 

XP  534729 

httD://www.tigr.org/tigr-scriDts/tgi/T  index.cgitsoec 

Oryctolagus  cuniculus 

ezrin 

Q8HZQ5 

AAN068I8 

http://www.ncbi.nlm.nih.fOv/entrez/ 

auerv.fcgi?db=gmd=Retrieve&doDt=Overview&list  uids=  12818 

Mus  musculus 

ezrin 

P26040 

CAA43086 

http://www.tigr.Org/tifr-scripts/tgi/T  index.cgiispec 

radixin 

P26043 

CAA43087 

merlin 

P46662 

CAA52737 

Rattus  norvegicus 

ezrin 

P3I977 

AAR9I694 

http://www.tigr.Org/tifr-scripts/tgi/T  index.cgiispec 

NF2 

XP  341249 

Callus  gallus 

ezrin 

Q9YGW6 

BAA75497 

http://www.tif r.orf/tifr-scripts/tfi/T  index.cfiispec 

radixin 

Q9PU4S 

CAB59977 

merlin 

NP  989828 

Xenopus  laevis 

unknown 

AAH77822 

http://vvww.xenbase.orf/ 

protein 

Danio  rerio 

nf2a 

Q6Q4I3 

AAS66973 

httD://vvww.ensembl.org/Danio  rerio/ 

moesin 

QS03E6 

AAH95359 

Fugu  rubripes 

radixin 

FRUPOOOOO 132603 

httD://genome.igi-Dsf.org/ 

moesin 

FRUP00000I563I3 

merlin 

FRUPOOOOO  136298 

Tetraodon  nigroviridis 

unnamed 

CAG08868 

httD://www.ensembl.org/Tetraodon  nigroviridis/ 

protein  1 

CAG08250 

unnamed 
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Table  I:  The  list  of  the  predicted  and  experimentally  annotated  merlin  and  ERM  proteins  included  in  this  study.  (Continued) 


protein  2 

Ciona  intestinalis 

erm-like 

merlin-like 

Ciona  savignyi 

merlin-like 

Biomphalaria  glabrata 

erm-like 

Lytechinus  variegates 

moesin 

Apis  mellifera 

similar  to 
schwannomin 

Drosophila  melanogaster 

merlin 

moesin 

Drosophila  yakuba 

merlin-like 

Anopheles  gambiae 

merlin-like 

fragment 

Caenorhabditis  elegans 

ermla 

erm  1  b 

nfm  la 

nfm  lb 

Caenorhabditis  briggsae 

erm-like 

nfm  1 

Caenorhabditis  remanie 

merlin-like 

erm-like 

Brugia  malayi 

merlin-like 

Schistosoma  japonicum 

JF2 

Taenia  saginata 

myosin-like 

Echinococcus  multilocularis 

EMIO 

Echinococcus  granulosus 

EGIO 

Phanerochaete  chrysosporium 

— 

Aspergillus  flavus 

— 

Arabidopsis  thaliana 

— 

Oryza  sativa 

— 

Trypanosoma  brucei 

— 

Cryptosporidium  parvum 

... 

PS2962 

AAK6I3S3 

AAC465I4 

ciO  100 149701 

ciO  100 130636 

paired_scaffold_  1 09 

Q24564 

XP  392673 

AAB08449 

P46I50 

AAB48934 

P9I0IS 

EAA07087 

AAB37643 

predicted  in  this  work 

P9I0I6 

AAB37642 

Q20307 

AAA  19073 

Q9SQG5 

AAK68385 

Q948I5 

AAB49033 

CAA65728 

BP:CBP03I33 

BP:CBP0S02S 

predicted  in  this  work 

3 1 6.m00022 

Q24796 

A45620 

CAA8262S 

http://genome.|^i-psf.org/ 

http://www.broad.mit.edu/ftp/ 
http://bioloifv.unm.edu/biomphalaria-genome/ 
http://www.hgsc.bcm.tmc.edu/project.'5/.';eaurchin/ 
http://racerxOO.tamu.edu/PHP/bee  search.php 

http://fbserver.gen.cam.ac.uk:708 1  / 

http://genome.wiistl.edu/blast/client.pl 
http://www.tigr.Org/tigr-scripts/tgi/T  index.cgitspec 

http://www.wormbase.org/ 


http://www.wormbase.org/ 

http://genome.wustl.edu/blast/client.pl 

http://www.tigr.org/tdb/e2k  I  /bma  I  / 
http://www.nhm.ac.uk/hosted  sites/schisto/ 
http://www.ncbi.nlm. nih.gov/entrez/query  .fcgiiCMD=protein 
http://www.sanger.ac.uk/Projects/Echinococcus/ 

http://genome.jgi-psf.org/whiterot  I  /whiterot  I  .home 
http://www.tigr.org/tigr-scripts/tgi/T  index.cgiispec 
http://www.tigr.Org/tigr-scripts/tgi/T  index.cgiispec 
http://www.tigr.Org/tigr-scripts/tgi/T  index.cgiispec 
http://www.tigr.org/tigr-scripts/tgi/T  index.cgiispec 
http://www.tigr.Org/tigr-scripts/tgi/T  index.cgi?species=c  parvum 
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100 


t'adixiii  Homo  sapietts 
['adudn  Stts  scrqfh 
radixui  Mus  mttscuitts 
radixiii  Galhis  galltis 

unnamed  protein  2  TetraoAon  nigroviridis 
i'adixin  Fugtt  mbripes 
^  QQ  ,  moesin  Homo  sapietts 
^  moesin  Sns  scrqfii 

I - moesin  Fttgu  mbripes 

99 1 - similai'  to  moesin  Danio  rerio 

I - eaiii  Gaiius  galhts 

ealii  Bos  /aunts 
caIn  Otyctoiagns  cnnicuUis 


100  ^ 
99 


jgip  eain  Homo  sapiens 
gQi  r  ealii  Mus  muscuhis 
1  ooL  eain Rattus uomegicus 
-  eiin-like  Ciona  in/estiutUis 


- enn-like  BiomphaJaria  gtabrata 

- moesin  Lytechinus  variegates 

- moesin  Drosophila  melatiogaster 

I  eiin-like  Caenorhabditis  remauie 
P  eiin-like  Caenorhabditis  briggsae 
iWil  Caenorhabditis  elegans 

821  eiinlb  Caenortmbditis  elegans 

- Jr2  Schistosoma  japonicnm 

I - myosin-like  Taenia  saginata 

I  EMIO  Echinococctts  multilocularis 
100 1  EGIO  Echinococctts  granulostts  - 

100 1  merlin-like  la  Caenorhabditis  elegans 
L  merlin-like  lb  Caenorltabditis  elegans 
mei'lin-like  Caenorhabditis  briggsae 
merlin-like  Caenorhabditis  remanie 


100 


merlin-like  Bntgia  niatt^'i 

inei'Un-like  (fragment)  Anopheles  gatnbiae 
similai'  to  scliwaiuioinin  Apis  mellij^m 
P  merlin  Drosophila  melanogaster 

- merlin-like  Drosophila  yakitba 

TOO  I - merlin-like  Ciona  intestinalis 

t - merlin-like  Ciona  savignyi 

nf2a  Danio  rerio 


unnamed  protein  1  Tetraodon  nigroviridis 
merlin  Fttgu  mbripes 
-  muiknown  protein  Xenoptts  laevis 
inerlin-like  Galltts  galitts 
P  similai'  to  nfZ  Canis JitmiUaris 
merlin  Ratbts  norvegietts 
merlin  Mus  mttscuitts 
merlin  Papio  anttbis 
merlin  Homo  sapiens 

■19 1 - similai'  to  nt2  Pan  troglotfytes 

- band  4.1  Danio  rerio 


ERM  clade 


Merlin  clade 


-  band  4.1  Mtts  mttscuitts 
-  hmxtX  4.1  Homo  sapiens 


Figure  I 

The  neighbor-joining  tree  of  the  ERM  family.  The  diagram  illustrates  the  basic  resolution  of  the  ERM-family  members 
into  two  major  clades,  merlin  and  ERM.  Bootstrap  support  values  are  shown  above  each  node.  Shaded  boxes  denote  different 
subgroups  of  the  ERM  clade  in  vertebrates,  which  appeared  after  the  expansion  of  the  ERM-like  ancestor.  The  Tetraodon  nigro¬ 
viridis  "unnamed  protein  I  and  2"  sequences  (GenBank  Accession  No.  CAG08868  and  CAG08250.  respectively)  and  the  Xeno- 
pus  laevis  "unknown  protein"  sequence  (GenBank  Accession  No.  AAH77822)  were  grouped  based  on  their  similarity  to  the 
merlin  or  ERM  sequences. 
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merlin  proteins  of  various  species,  and  examined  the 
exon-intron  structural  evolution  of  the  NF2  gene. 

Results  and  Discussion 

BLAST  identification  of  merlin  sequences 

To  identify  putative  merlin  and  ERM  sequences  in  a  wide 
range  of  eukaryotes,  we  performed  BLAST  analysis  of  15 
available  genome  databases.  By  searching  through  all 
annotated  proteins  and  genome  sequences,  we  identified 
50  sequences  from  30  species.  Table  1  summarizes  the  full 
list  of  the  predicted  and  annotated  merlin  and  ERM  pro¬ 
teins  identified,  and  their  GenBank  and  available  Uni- 
ProtKB/Swiss-Prot  accession  numbers  and  related 
resources.  No  merlin-like  sequences  were  found  in  the 
genomes  of  fungi,  plants,  and  protozoa.  While  the 
sequencing  projects  of  the  hard  ticks  are  still  ongoing  at 
The  Institute  for  Genomic  Research  (TIGR),  amino  acid 
sequences  deduced  from  partial  cDNAs  of  salivary  glands, 
which  share  a  similarity  with  the  PERM  domain  of  merlin, 
have  been  noted  from  Rhipicephalus  appendiculatus  [43], 
Amblyomma  variegatum  [44],  and  Boophilus  microplus  [45]. 

Assembly  of  predicted  merlin  sequences  from  whole 
genome  shotgun 

To  date,  the  genomes  of  Caenorhabditis  remanei  and  Dro¬ 
sophila  yakuba  are  represented  by  a  set  of  contigs  [46]. 
When  contigs  are  ordered,  oriented,  and  positioned  with 
respect  to  each  other  by  mate-pair  reads,  they  are 
described  as  a  scaffold.  Scaffolds  are  the  main  product  of 
the  Whole  Genome  Shotgun  strategy  and  can  be  assigned 
to  chromosomes  using  chromosome-specific  markers. 
Although  the  extensive  scaffolds  for  the  genomes  of 
Caenorhabditis  remanei  and  Drosophila  yakuba  are  not  cur¬ 
rently  available,  we  were  able  to  assemble  predictive  pro¬ 
tein  sequences,  which  most  resemble  the  merlin  sequence 
of  the  closely-related  organism,  Caenorhabditis  elegans  or 
Drosophila  melanogaster,  respectively,  using  TBLASTN 
search  across  the  available  sets  of  contigs.  In  the  Drosophila 
yakuba  contig  49.37,  we  identified  a  predicted  merlin 
sequence,  which  is  nearly  identical  to  that  of  the  Dro¬ 
sophila  melanogaster  protein  with  the  exception  of  three 
positions  at  the  G-terminus,  two  substitutions  at 
Glu468^Asp  and  Asn^^s^Ser  and  an  insertion  of  Lys  at 
position  575.  Also,  we  found  three  Caenorhabditis  remanei 
contigs,  564.6,  2151.1,  and  2151.2,  which  contained  mer- 
lin-like  sequences  with  similarity,  ranging  from  81%  to 
100%,  to  its  Caenorhabditis  elegans  counterpart.  It  should 
be  noted  that  the  deduced  amino  acid  sequences  were 
assembled  manually,  and  in  some  cases,  only  partial  or 
approximate  amino  acid  sequences  could  be  obtained. 
Nevertheless,  they  were  useful  for  the  identification  of  the 
definite  gene  in  the  respective  genome  and  were  valuable 
for  the  following  phylogenetic  reconstruction  in  order  to 
validate  the  functional  relationship  and  evolution  of  the 
definite  gene. 


Construction  of  a  phylogenetic  tree  for  the  ERM  family  of 
proteins 

To  understand  the  origin  and  evolution  of  merlin,  we  con¬ 
ducted  a  phylogenetic  analysis  of  the  50  proteins  of  the 
ERM  family,  which  were  identified  from  30  different  taxa 
(Table  1)  using  the  neighbor- joining  method  [47,48] 
combined  with  the  molecular  evolutionary  genetics  anal¬ 
ysis  program  MEGA2  [49].  Three  protein  4.1  sequences 
from  humans,  mice,  and  zebrafish,  respectively,  were  used 
as  an  outgroup.  By  comparing  the  bootstrap  support  val¬ 
ues,  which  denote  the  number  of  times  a  grouping  occurs 
out  of  1,000  random  samples  from  the  alignment,  we 
constructed  a  phylogenetic  tree  for  the  ERM  family  of  pro¬ 
teins  (Figure  1).  Based  on  this  phylogenetic  analysis,  the 
entire  ERM  family  can  be  subdivided  into  the  ERM  clade 
and  the  merlin  clade.  While  both  clades  show  a  strongly 
supported  monophyletic  origin,  the  merlin  clade  can  be 
robustly  defined  and  separated  from  the  ERM  clade  (the 
bootstrap  support  value  =  100).  We  identified  a  total  of  22 
sequences  for  the  merlin  clade  and  28  sequences  for  the 
ERM  clade.  The  topology  of  the  phylogenetic  tree  within 
the  merlin  clade  appears  to  agree  with  the  general  concept 
of  evolutionary  history  of  speciation. 

The  merlin  clade  can  be  further  divided  into  three  groups 
according  to  the  order  of  derivation:  worms,  insects,  and 
Ghordata,  with  the  earliest  separated  genus,  Ciona,  in  the 
last  taxonomic  unit  (Figure  1).  The  predicted  merlin-like 
sequence  from  Caenorhabditis  remanei  branched  from  that 
of  Caenorhabditis  elegans,  and  similarly.  Drosophila  yakuba 
diverged  from  its  Drosophila  melanogaster  counterpart. 
Both  the  "unnamed  protein  1"  of  Tetraodon  nigroviridis 
and  the  "unknown  protein"  of  Xenopus  laevis  from  the 
GenBank  database  are  clustered  in  the  Ghordata  merlin¬ 
like  group  with  high  bootstrap  probabilities  (Figure  1), 
which  confirms  their  identity  as  merlin  homologs.  The 
protein  fragment  from  Anopheles  gambiae,  which  bears  a 
sequence  similarity  to  merlin,  is  grouped  together  with 
the  Apis  mellifera  merlin-like  protein  by  a  bootstrap  sup¬ 
port  value  of  100. 

Although  the  ERM-like  proteins  have  been  identified  in 
Taenia  saginata,  Schistosoma  japonicum,  Echinococcus  granu¬ 
losus,  and  Echinococcus  multilocularis  [36-39],  we  did  not 
find  any  merlin-like  sequences  in  the  genomes  of  these 
species.  The  lack  of  merlin-like  sequences  in  these  parasite 
genomes  may  be  due  to  incomplete  genome  sequences  in 
the  database;  however,  this  explanation  is  unlikely 
because  the  merlin-like  sequence  was  also  not  observed  in 
the  genome  of  Schistosoma  mansoni,  which  has  been  rigor¬ 
ously  studied  [50].  Another  possibility  is  that  the  absence 
of  merlin-like  sequences  in  these  organisms  may  reflect 
their  adaptation  to  a  parasitic  lifestyle  and  the  reduction 
of  various  organ  systems.  Alternatively,  the  merlin  protein 
may  emerge  later  during  evolution.  Similarly,  no  merlin- 
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546 

EIEALKLKERET - ALDILHNENSDRGG - S  SKHNT  I KKLTLQS  AKSRVAFFEEL - 595 

EIEALKLKERET - ALD  I LHNENSDRGGT- S  SKHNT  I  KKLTLQS  AKSRVAFFEEL - 722 

EIEALKLKERET - ALD  VLHSE  S  SDRGGP  -  S  SKHNT  I  KKLTLQS  AKSRVAFFEEL - 596 

EIEALKLKERET - ALD  I LHNENASRGN - SKHNT  I KKVSEGS S L YLA - 589 

EIESLKLKERES - AMDIMH - ENAG - SKQNT  I KKARRAVC I - 585 

EIESLKLEEQQQ - AGVYNLRSYAEPPFIPPSNRNSAYMAQMAFYEE - 585 

EIESLKLKERET - P  LD  1 1 HNQNTEQGT - SKQSNFKK - 536 

EIEVLKVDESMT - GFDQKQDS - NQ - PHTHE I  STFQGHKETPQ  Y  YDGL - 670 

EIEVLKVDENTG - PFNQKPDP - SQ - SVSHDATTFQSHNE - 627 

EIEQLKIGENQC - PLDDINAEQLRLGE - TKYSTLKKVKSGSTKARVAFFEEL - 416 

EIEVMKVGEKQC - ELDQLHEEQVRLGE - NKYSTLKKVKSGSTKARVAFFEEL - 637 

EIAPHKIEENQS - NLD  I LSEAQ I KAGE - NKYSTLKKLKSGSTKARVAFFEEL - 635 

DIDGLKRDGNVQ-NG - QHREHDAVHAQNVAHGFDKFTTMRMSMRGTPRQR — AQAFDGM -  654 

DIDGLKRDENMT - IQQHREHDAIHAQNVAQGFDKFTTMRMVRQG - 635 

EIESLKVVDRQS - EHDRIHAANLQMGI - DKYSTLR - 438 

ELSQARDENKR - THND  1 1 HNENMRQGR - DKYKTLRQIRQGNTKQRIDEFEAL - 586 

ELSQARDENKR - THND  1 1  HNENMRQGR - DKYKTLRQIRQGNTKQRIDEFEAM - 581 

ELAQARDEDKR - TQND  1 1 HSENVRQGR - DKYKTLRQIRQGNTKQRIDEFEAM - 585 

ELAQARDETKK - TQND  VLHAENVKAGR - DKYKTLRQIRQGNTKQRIDEFEAM - 583 

ELAQARDETKK - TQND  VLHAENVKAGR - GKYKTLRQIRQGNTKQRIDEFEAM - 583 

GLGSELGVGGS - SRRHQEDAERHAARR - ERQGRKRQVQNAASDPPGQHQAAHRRVR  600 

ELANARDESKK - TANDMLHAENMRLGR - DKYKTLRQIRQGNTKQRIDEFESM - 577 

ELANARDESKK - TVND  I LHAENVRAGR - DKYKTLRQIRSGNTKQRIDEFECM - 574 

QLSQLRDNNVTS - TQMD  I LHNENVKAGR - DKYKTLKQIRSGNTKHRIDEFECL - 609 

DLDAEKTKQNA - IDLLHQENMRQGR - DKYKTLKQIRQGNTKQRVDEFESM - 587 

ELQAMKDESKGE - DRYDKIHQENIRAGR - DKYQTLRNIRSGNTRQRIDTFENI - 572 

ELDSVKDQNAV - TD  YD  VLHMENKKAGR - DKYKTLRQIRGGNTKRRIDQYENM - 563 

ELDSVKDQNAV - TD  YD  VLHMENKKAGR - DKYKTLRQIRGGNTKRRIDQYENM - 584 

DLAQSRDETKET - ANDKIHRENVRQGR - DKYKTLREIRKGNTKRRVDQFENM - 578 

ELSSTRDPSKM - RDIDRHHEYNVREGN - DKYKTLRNIRKGNTMCRVEQFESM - 559 

ELSSTRDQSKM - RDIDRRHEYNVREGN - DKYKTLRNIRKGNTMCRVEQFESM - 559 

Actin-binding  site 


Figure  2 

Sequence  alignments  of  functionally  important  sites  in  the  merlin  and  ERM  proteins  of  various  species.  Com¬ 
parison  of  the  C-terminal  region  including  the  actin-binding  site  and  two  other  predicted  significant  residues.  Databank 
resources  for  the  ERM-family  proteins  listed  in  Table  I  were  used  in  the  analysis,  and  only  typical  representatives  from  each 
group  are  displayed. 


like  sequence  was  found  in  the  complete  genomes  of  pro¬ 
tozoa,  fungi,  and  plants.  Based  on  these  results,  we  sup¬ 
pose  that  the  derivation  of  merlin  occurred  in  the  early 
metazoa  after  its  separation  from  flatworms. 

As  illustrated  in  the  ERM  clade  in  Figure  1,  the  ERM-like 
proteins  found  in  parasites  can  be  grouped  together  but 
form  a  separate  branch  from  the  rest  of  ERM  proteins. 
Based  on  the  phylogenetic  analysis,  the  clustering  of  the 
"unnamed  protein  2"  of  Tetraodon  nigroviridis  with  the 
Fugu  rubripes  radixin  protein  defines  it  as  a  radixin-like 
protein.  It  should  be  noted  that  the  two  predicted  ERM 
proteins,  ermla  and  ermlb  of  Caenorhabditis  eiegans  [51], 
may  represent  different  transcript  variants  of  the  same 
gene  (also  see  below). 


Furthermore,  we  have  observed  the  evident  expansion  of 
the  ERM-like  ancestor  in  vertebrates  (Figure  1).  Since  the 
ERM  homolog  of  Ciona  emerged  prior  to  the  vertebrate 
clade,  it  appears  that  the  first  duplication  of  the  vertebrate 
ERM  sequence  occurred  after  its  divergence  from  Ciona. 
Subsequent  expansion  within  this  sub-family  has  led  to 
the  present  existence  of  three  related  groups  of  proteins, 
ezrin,  radixin  and  moesin;  among  which,  the  ezrin  group 
is  the  most  ancient.  Such  an  expanded  complement  may 
only  be  common  to  the  ERM  proteins  of  vertebrates 
because  other  metazoa  have  only  one  predicted  ERM-like 
homolog  [52-56].  Curiously,  the  increasing  number  of 
ERM  members  that  occurred  within  the  vertebrate  clade 
paralleled  the  evolutionary  complexity  of  the  organism.  It 
will  be  important  to  understand  how  these  proteins 


Page  7  of  1 8 

(page  number  not  for  citation  purposes) 


BMC  Evolutionary  Biology  2005,  5:69 


http://www.biomedcentral.eom/1471-2148/5/69 


H.  sapiens  merlin 

C.  familiaris  similar  to  nf2 
M.  musculus  merlin 

G.  gallus  merlin-like 

X  laevis  unknown  protein 

D.  rerio  nf2a 

F.  rubripes  merlin 

C.  intestinalis  merlin-like 

C.  savignyi  merlin-like 

A.  gambiae  fragment  of  merlin 

A.  mellifera  similar  to  schwannomin 

D.  melanogaster  merlin 
C.  elegans  nfm-la 

C.  briggsae  merlin-like 

B.  malayi  merlin-like 

H.  sapiens  ezrin 

B.  taurus  ezrin 

G.  gallus  ezrin 

H.  sapiens  radixin 

G.  gallus  radixin 

T.  nigroviridis  unnamed  protein  2 

H.  sapiens  moesin 
F.  rubripes  moesin 

C.  intestinalis  erm-like 

B.  glabrata  erm-like 
L.  variegates  moesin 

C.  elegans  erm-like  la 

C.  briggsae  erm-like 

D.  melanogaster  moesin 
T.  saginata  myosin-like 

E.  multilocularis  EMIO 


Blue  Box  204 

INLYQ-MTPEMWEERITAWYAEHRGRARDEAEMEYLK  209 
INLYQ-MTPEMWEERITAWYAEHRGRARDEAEMEYLK  318 
INLYQ-MTPEMWEERITAWYAEHRGRARDEAEMEYLK  209 
INLYQ-MTPEMWEERITAWYAEHRGRARDEAEMEYLK  209 
INLYQ-MTPEMWEERITAWYAEHRGRTRDEAEMEYLK  209 
LMQYQ-MTPDMWEEKITAWYAEHRNITRDEAEMEYLK  201 
INLYQ-MTAEMWEERITACYAEHRGRTRDEAEMEYLK  170 
RDQFQSVTGEMWETQITSWYAQHHGLTRDEAELEYLK  207 
IDQYQSVTGQMWEAQITPWYAGHHGLTRDEAELEYLK  179 
— QYQ-MTPQMWEERIKTWYADHRGMSRDEAEMEYLK  34 
IDQYQ-MTPEMWEDRIKIWYADHRGMSRDEAEMEYLK  201 
TDQYQ-MTPEMWEERIKTWYMDHEPMTRDEVEMEYLK  203 
IDQYD-MSADMWRDRIKRWWSRNAGQSREEAELEYLR  203 
IDQYD-MSADMWRDRIKRWWSRNAGQSREEAELEYLR  203 
IKQYD-MTPOMWEERIKRWWINNSGQSREDAEMEYLR  196 
MDQHK-LTRDQWEDRIQVWHAEHRGMLKDNAMLEYLK  193 
MDQHK-LTRDQWEDRIQVWHAEHRGMLKDSAMLEYLK  193 
MDQHK-LSRDQWEERIQVWHAEHSGMLKENAMLEYLK  193 
LEQHK-LTKEQWEERIQNWHEEHRGMLREDSMMEYLK  193 
LEQHK-LTKEQWEERIQNWHEEHRGMLREDSMMEYLK  193 
LEQHK-LTKEQWEERIQTWHEEHRSMLREDAMMEYLK  189 
LEQHK-LNKDQWEERIQVWHEEHRGMLREDAVLEYLK  193 
LDQHK-LNKDQWEERIQVWHEEHKGMMREESMMEYLK  189 
YEQHK-MTKEQWEERIQTWHCEHGSMTREDAMIEYLK  217 
YDQHK-LTKEQWEERIKSWYAEHKAMLREDAMIEYLK  194 
lEQHK-MTKEQWYERVSNWHQEHLSLSKEDAITEYMK  192 
LGQFK-LNSEEWERRIMTWWADHRATTREQAMLEYLK  194 
LGQFK-LNSEEWERRIMTWWADHRATTREQAMLEYLK  189 
IDQHK-MSKDEWEQSIMTWWQEHRSMLREDAMMEYLK  194 
KDQYD-QTDEQWFDRIVTYYKDHHDMSREDAMVQYLQ  195 
KEQYD-QTDEQWYERIIAYYKDHHDMSREDAMVQYLQ  195 


Figure  3 

Alignment  of  the  N-terminal  domain,  containing  the  Blue  Box  and  the  amino  acid  residue  204,  conserved  among  the  merlin 
proteins  but  divergent  in  the  ERM  proteins. 


evolved  and  how  their  functions  coordinated  because  of 
the  important  and  diverse  functions  of  ERM  proteins 
[8,25,26]. 

Evolution  of  the  functionally  important  residues  in  merlin 

Although  initial  identification  of  proteins  via  sequence 
similarities  does  not  yield  a  clear  indication  of  their 


respective  functions,  analysis  of  specific  conserved  regions 
and  residues  may  provide  important  information  regard¬ 
ing  their  putative  functional  characteristics.  We  conducted 
pairwise  sequence  comparison  among  all  obtained  merlin 
and  ERM  sequences,  and  identified  several  regions  of 
interest.  The  results  of  the  entire  sequence  alignment  are 
provided  in  the  Additional  Eile  1  and  are  summarized  in 
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Figure  4 

Sequence  alignments  reveal  conservation  of  several  functionally  important  residues,  including  the  major  phosphorylation  site  of 
the  merlin  group. 


Figures  2,  3,  and  4.  Previously,  the  conservation  of  the  N- 
terminal  PERM  domain  among  human  ERM  proteins  and 
their  functional  importance  were  described  [10-13].  In 
our  alignment,  we  showed  that  this  conservation 
extended  to  the  merlin  and  ERM  proteins  of  various  spe¬ 
cies  for  which  sequences  were  available  to  date.  These  data 
suggest  a  universal  role  for  the  presence  of  the  PERM 
domain  during  evolution  and  further  imply  an  existence 
of  certain  evolutionary  constraints  on  the  changes  of  their 
amino  acid  residues. 

Although  merlin  lacks  the  C-terminal  actin-binding  site 
found  in  ERM  proteins  [7,20,21,57],  it  can  directly  inter¬ 
act  with  the  actin  cytoskeleton  [22,58]  or  indirectly  bind 
via  the  actin-binding  protein  pil  spectrin/fodrin  [23,24]. 
Sequence  alignment  showed  extensive  amino  acid  varia¬ 
bility  in  the  C-terminal  region  of  the  merlin  proteins  of 
various  species,  while  a  noncontiguous  stretch  of  25 
amino  acid  residues,  including  the  well-defined  actin- 
binding  site,  was  reliably  aligned  among  all  predicted 
ERM  proteins  with  the  exception  of  the  "unnamed  protein 
2"  of  Tetraodon  nigroviridis  (Figure  2).  According  to  the 
phylogenetic  tree,  the  "unnamed  protein  2"  of  Tetraodon 
nigroviridis  is  classified  in  the  radtxin  group  (Figure  1),  and 
its  sequence  visibly  differs  from  other  radixin  proteins 
only  at  the  C-terminus.  The  reason  for  this  sequence  vari¬ 
ability  is  presently  unknown.  It  may  be  due  to  an  inaccu¬ 
racy  in  sequence  assembly  from  the  scaffold.  Alternatively, 
the  "unnamed  2  protein"  may  possess  a  unique  character¬ 
istic  and  will  be  of  considerable  interest  for  functional 
comparison  with  other  radixin  proteins. 


Sequence  variability  at  the  C-terminal  domain  of  the  mer¬ 
lin  proteins  of  various  species  appears  to  be  high,  while 
some  conservation  can  be  found  within  separate  taxo¬ 
nomic  groups  such  as  vertebrates,  insects,  and  worms 
(Figure  2  and  Additional  File  1).  A  part  of  the  C-terminal 
region  is  absent  in  Fugu  rubripes,  Danio  rerio,  C.  briggsae, 
and  Brugia  malayi.  This  may  be  due  to  partial  assembly  of 
the  protein  sequences,  as  all  of  them  were  predicted  by 
bioinformatics  using  the  available  genomes  and  cDNA 
sequences.  Alternatively,  the  lack  of  conservation  in  the  C- 
terminal  region  of  merlin  in  these  species  may  imply  that 
this  region  does  not  share  the  same  function.  In  the 
remaining  organisms,  the  C-terminal  amino  acid  residues 
have  a  specific  charge  distribution,  in  spite  of  decreased 
hydrophilicity,  when  compared  with  the  C-terminal  part 
of  moesin  [15];  however,  they  likely  form  structures  simi¬ 
lar  to  the  B,  C,  and  D  helices  found  in  moesin. 

Unlike  ERM  proteins,  two  regions  (residues  1-27  and 
280-323)  in  the  N-terminal  half  of  merlin  have  been 
mapped  that  are  sufficient  for  binding  to  F-actin  [59,60]. 
The  first  17  amino  acids  in  the  N-terminus  of  human  mer¬ 
lin  are  present  in  the  merlin  proteins  of  various  species 
but  not  in  any  ERM  proteins  (see  Additional  Eile  1).  The 
merlin  proteins  of  higher  vertebrates  contain  these  resi¬ 
dues,  eight  of  which  are  absent  in  the  merlin  proteins  of 
other  organisms.  Crystal  stmeture  analysis  suggests  that 
the  structure  of  these  extreme  N-terminal  residues  of  mer¬ 
lin  is  disordered  in  solution  but  likely  becomes  ordered  as 
merlin  binds  to  some  effector  targets  [17].  Our  sequence 
alignment  indicates  that  the  conservation  in  the  extreme 
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N-terminus  of  merlin  extends  to  the  first  27  residues.  The 
distribution  of  specific  positively-charged  residues  also 
appears  to  he  conserved  in  this  N-terminal  portion  of  the 
merlin  proteins  of  lower  vertebrates  and  insects.  These 
results  suggest  that  the  first  27  amino  acids  of  merlin  serve 
as  a  common  protein-binding  motif  It  is  noteworthy  that 
a  similar  sequence  can  be  found  in  the  ERM-like  protein 
of  dona;  however,  the  N-terminal  region  of  the  Ciona  pro¬ 
tein  contains  ten  positively-charged,  basic  amino  acids, 
which  may  affect  the  binding  to  actin  and/or  other  pro¬ 
teins  (Additional  File  1). 

The  internal  actin-binding  site,  containing  residues  280- 
323  in  the  N-terminal  half  of  merlin,  was  found  to  be 
highly  conserved  among  all  merlin  and  ERM  proteins  ana¬ 
lyzed,  particularly  the  last  30  amino  acid  residues  (Addi¬ 
tional  File  1).  This  region  contains  an  extended  helix  at 
the  beginning  of  the  a-helical  domain  and  its  importance 
is  supported  by  the  identification  of  several  disease-caus¬ 
ing  mutations  (S315F,  L316F,  L316W,  Q324L),  which 
were  predicted  to  destabilize  the  a-helical  segment  and 
disrupt  its  hydrogen  bonding  with  subdomain  A  [16-18]. 
In  addition,  these  residues  have  been  shown  to  associate 
with  F-actin  in  moesin  [61,62]  and  to  contribute  to  the 
ICAM-2-binding  site  in  radixin  [14]. 

Previously,  Lajeunesse  et  al.  [63]  identified  seven  func¬ 
tionally  important  amino  acid  residues 
(17oyqmTPEM17^)  in  the  N-terminal  domain  of  Dro¬ 
sophila  merlin,  called  the  "Blue  Box."  These  seven  amino 
acids  are  identical  between  the  human  and  Drosophila 
merlin  proteins  but  differ  from  the  ERM  proteins. 
Sequence  comparison  revealed  a  more  conserved  pattern 
of  the  Blue  Box;  all  seven  amino  acid  residues  of  the  Blue 
Box  were  found  to  be  identical  in  the  merlin  sequences 
from  vertebrates,  fruit  flies,  and  honeybees  (Figure  3); 
however,  several  amino  acid  substitutions  were  found  in 
the  Blue  Box  of  worms,  fish,  and  Ciona.  The  most  interest¬ 
ing  substitutions  were  found  in  the  merlin-like  protein  of 
Caenorhabditis  from  i74xhrProGlui76to  i74SerAlaAspi^®.  It 
is  noteworthy  that  the  methionine  residue  at  position  177 
in  the  Blue  Box  is  conserved  among  all  merlin  proteins 
but  not  in  the  ERM  proteins.  These  results  further  corrob¬ 
orate  the  functional  importance  of  the  seven  amino  acids 
in  the  Blue  Box  [63]. 

According  to  the  crystal  structure  of  the  PERM  domain  in 
human  merlin,  the  Blue  Box  residues  are  located  in  helix 
a3B  of  subdomain  B  [18]  and  form  a  defined  area  that  is 
located  on  the  surface  of  the  protein  [17].  Intriguingly,  the 
three-dimensional  conformation  of  merlin's  Blue  Box 
region  is  similar  to  that  of  the  equivalent  region  in  radixin 
[18],  suggesting  that  regions  in  addition  to  the  Blue  Box 
are  required  for  merlin  to  function  as  a  tumor  suppressor. 
Note  that  regions  closely  adjacent  to  the  Blue  Box-equiva¬ 


lent  residues  in  human  ERM  proteins  have  been  shown  to 
participate  in  the  N-terminal  to  C-terminal  intramolecular 
interaction  and  ligand-binding,  enabling  increased 
mobility  and  structural  changes  in  the  activated  PERM 
domain  [14-16,64].  In  light  of  the  functional  importance 
of  the  Blue  Box  in  Drosophila  merlin,  its  sequence  conser¬ 
vation  during  evolution,  and  its  location  on  the  surface  of 
merlin,  the  Blue  Box  probably  participates  in  specific  pro¬ 
tein-protein  interactions  and  contributes  to  other  activi¬ 
ties  of  merlin. 

As  in  ERM  proteins,  phosphorylation  affects  the  subcellu- 
lar  localization  and  intra-  and  inter-molecular  associa¬ 
tions  of  merlin  [13,30-32].  In  addition,  it  modulates  the 
ability  of  merlin  to  suppress  cell  growth  [34,35].  Two 
phosphorylation  sites  have  been  mapped  to  the  Serais  and 
Thr576  residues  in  the  merlin  protein.  Phosphorylation  on 
SerSis  been  shown  to  modulate  the  ability  of  merlin  to 
form  intramolecular  associations  and  to  bind  to  critical 
effectors  important  for  growth  suppression  [34,35].  In 
contrast,  phosphorylation  on  the  Thr^^e  residue  has  no 
effect  on  merlin's  functional  activity,  while  phosphoryla¬ 
tion  on  this  residue  is  important  for  the  function  of  ERM 
proteins  [57,65-67].  Sequence  alignment  shows  that  the 
SerSis  residue  is  conserved  across  all  merlin  proteins  of 
various  species  with  the  exception  of  the  fruit  fly  and  the 
worm,  which  contain  a  related  threonine  residue  at  the 
corresponding  position  (Figure  4).  Since  both  the  serine 
and  threonine  residues  can  be  phosphorylated,  we  suggest 
that  the  corresponding  threonine  residue  in  merlin  pro¬ 
teins  of  the  fly  and  the  worm  may  act  as  a  phosphoryla¬ 
tion  site. 

Gutmann  et  al.  showed  that  mutations  within  the  pre¬ 
dicted  a-helical  region  of  the  human  merlin  protein  had 
little  effect  on  its  function,  whereas  those  in  its  N-  or  G-ter- 
minus  rendered  the  protein  inactive  as  a  negative  growth 
regulator  [28,29].  Specifically,  five  naturally  occurring 
missense  mutations,  L64P,  K79E,  E106G,  L535P  and 
Q538P,  were  found  to  inactivate  merlin  function.  Interest¬ 
ingly,  we  found  that  the  Leu®'*  and  Lys^^  residues  were  con¬ 
served  among  the  merlin  and  ERM  proteins  of  various 
species  (Figure  4).  According  to  the  crystal  structure  of  the 
FERM  domain  of  merlin,  the  L64P  substitution  would  cre¬ 
ate  a  cavity  in  the  hydrophobic  core  of  subdomain  A  and 
affect  its  P-sheet  structure  [17,18].  The  significance  of  this 
structural  information  was  further  supported  by  the  find¬ 
ing  that  the  L64P  mutation  impaired  the  ability  of  merlin 
to  form  an  intramolecular  complex  between  its  two  N-ter- 
minal  interaction  sites  [28].  Moreover,  the  L64P  mutant 
lost  its  ability  to  bind  the  cytoplasmic  tail  of  CD44;  this 
interaction  correlates  with  the  ability  of  merlin  to  func¬ 
tion  as  a  growth  suppressor  [29]. 
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The  Lys^3  residue  is  situated  at  the  end  of  helix  a4A,  and 
mutation  at  this  residue  (K79E)  may  cause  the  formation 
of  a  salt  bridge  with  its  neighboring  Lys^®  residue,  which  is 
normally  hydrogen  bonded  to  Tyr®®  in  helix  a3A  [17]. 
Two  equivalent  lysine  residues,  Lys®°  and  Lys®^,  were 
found  in  module  FI  of  moesin  and  were  predicted  to  be 
involved  in  specific  protein  interactions,  consequently 
changing  the  structure  of  an  activated  molecule  [15,16]. 
Together  with  a  group  of  positively-charged  amino  acids 
at  the  beginning  of  the  helix  module  F3  (R275,  K278, 
R279),  these  lysine  residues  interact  with  the  negatively- 
charged  residues  (342-RFKFF-346)  in  the  C-terminal 
region  [16].  Importantly,  most  of  the  homologous  posi¬ 
tively-charged  residues  located  between  lobes  FI  and  F3 
in  the  radixin  protein  have  been  shown  to  bind  to  inositol 
1,4,5-trisphosphate  (IP3)  [15].  In  addition  to  Lys^^,  the 
Lys76  residue  was  also  found  to  be  highly  conserved 
among  various  merlin  and  FRM  proteins  with  the  excep¬ 
tion  of  the  worm  protein,  which  has  a  Gln^®  instead  of 
Lys76  (Figure  4  and  Additional  File  1).  Also,  the  FRM-like 
proteins  of  parasites  Taenia  saginata,  Echinococcus  granulo¬ 
sus,  and  Echinococcus  multilocularis  contain  an  Arg^®  resi¬ 
due,  which  is  also  a  basic  amino  acid  residue  and  may  be 
capable  of  participating  in  interactions  similar  to  those  of 
the  corresponding  lysine  residue.  On  the  contrary,  in  the 
1F2  protein  of  Shistosoma  mansoni,  the  position  equivalent 
to  Fys®o  of  moesin  is  occupied  by  a  glutamic  acid  residue, 
and  no  conservation  of  residues  275,  278,  and  279  in  the 
1F2  protein  was  found,  suggesting  a  unique  structural  fea¬ 
ture  for  this  Shistosoma  protein. 

Several  other  naturally-occurring  missense  mutations  on 
human  merlin,  including  F106G,  F535P  and  Q538P,  have 
also  been  found  to  affect  its  functional  activity  [29,68]. 
Our  sequence  alignment  revealed  that  the  Glu*°®,  Leu®^®, 
and  Gin®®®  residues  were  conserved  among  the  merlin 
proteins  of  the  Ghordata  group  (Figure  4),  highlighting 
the  general  importance  of  these  residues  for  merlin  func¬ 
tion.  Similar  to  the  F64P  mutation  described  above,  the 
E106G  mutation  resulted  in  impaired  intramolecular 
associations  of  merlin  [29].  Flowever,  the  Feu®'*  residue  is 
highly  conserved  among  all  merlin  and  FRM  proteins  of 
various  species,  while  Glu*°®  is  conserved  only  in  the  mer¬ 
lin  proteins  of  Chordata  and  worms.  In  the  crystal  struc¬ 
ture  of  the  PERM  domain  of  merlin,  the  Glu*°®  residue  is 
located  at  the  linker  A-B  (al'B)  and  participates  in  the 
inter-subdomain  interaction  by  forming  a  hydrogen  bond 
with  the  Fys^s®  residue  [17,18].  This  interaction  may  ena¬ 
ble  subdomain  B  to  rotate  closer  to  subdomain  C.  Intrigu- 
ingly,  Lys®®®  is  conserved  only  among  the  merlin  proteins 
of  mammals,  chickens,  frogs,  honeybees,  and  mosquitoes 
(Additional  File  1).  In  the  merlin  proteins  of  other  species, 
a  negatively-charged  aspartic  or  glutamic  acid  occupies 
this  position,  except  in  fish.  Instead  of  lysine,  an  arginine 
residue  was  found  in  the  homologous  position  of  all  FRM 


proteins  (e.g.,  Arg^^s  for  moesin),  except  for  the  ERM-like 
proteins  in  parasites  (Additional  File  1).  This  Arg®-'®  resi¬ 
due,  located  at  the  beginning  of  the  helix  of  subdomain 
F3,  lies  in  a  specific  cleft  between  subdomains  FI  and  F3 
with  the  positively-charged  R275,  K278,  and  R279  resi¬ 
dues.  According  to  the  structure  of  radixin,  the  IPj-bind- 
ing  site  is  located  at  this  basic  cleft  [14].  This  region  in  the 
moesin  protein  has  also  been  shown  to  interact  with  its  G- 
terminal  domain  [16]. 

It  should  be  mentioned  that  residues  that  are  conserved  in 
the  merlin  proteins,  but  not  in  the  FRM  proteins,  of  vari¬ 
ous  species  may  be  important  for  elucidating  the  func¬ 
tional  difference  between  the  merlin  and  FRM  proteins. 
We  found  that  the  glutamic  acid  residue  at  position  204  of 
human  merlin  was  conserved  among  all  merlin  proteins 
(Figure  3).  In  contrast,  variable  and  uncharged  amino 
acids  were  found  at  the  corresponding  position  of  the 
FRM  proteins.  Grystal  structure  of  the  PERM  domain  of 
human  merlin  shows  that  the  Glu®°'*  residue  lies  in  the 
beginning  of  helix  a4B  and  is  included  in  a  specific  stretch 
of  amino  acids  in  subdomain  B  [17].  By  sequence  align¬ 
ment  of  human  merlin  and  FRM  proteins,  about  70 
amino  acids,  including  this  specific  stretch  of  residues, 
which  are  unique  to  merlin  but  different  in  FRM  proteins, 
were  identified  (see  Additional  File  1).  The  majority  of 
these  amino  acids  can  be  subdivided  into  three  clusters; 
each  of  them  is  specific  to  its  corresponding  subdomain 
and  is  located  on  the  surface  of  the  protein.  These  results 
suggest  that  these  70  amino  acids  likely  take  part  in  pro¬ 
tein-protein  interactions.  Note  that  the  conserved  stretch 
of  amino  acids  in  subdomain  B  also  includes  the  func¬ 
tionally  important  "Blue  Box"  discussed  above. 

Similarly,  the  isoleucine  residue  at  position  546  was 
found  to  be  conserved  among  the  merlin  proteins  of  vari¬ 
ous  species,  while  a  leucine  residue  was  present  in  the  cor¬ 
responding  position  in  all  FRM  proteins  (Figure  2).  The 
residue  homologous  to  Feu®^®  in  the  G-terminal  domain 
of  moesin  is  located  at  the  end  of  helix  A  with  other 
hydrophobic  residues,  tightly  contacting  the  hydrophobic 
faces  of  helices  B  and  D  of  subdomain  F2  [15].  Although 
the  information  about  such  an  interaction  in  merlin  is 
presently  unavailable,  additional  crystal  structure  analysis 
should  allow  us  to  better  understand  the  importance  of 
this  amino  acid  residue.  In  addition,  it  will  be  interesting 
to  test  whether  mutations  in  the  conserved  amino  acid 
residues  identified  in  this  study  could  affect  protein  func¬ 
tion. 

Predicted  secondary  structure  of  merlin  and  comparative 
analysis  of  the  predicted  a-helical  region 

Turunen  et  al.  previously  reported  that  the  central  region 
of  FRM  proteins  contained  approximately  200  residues 
that  were  predicted  to  be  mostly  a-helical  [19].  To  exam- 
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Figure  5 

The  predicted  secondary  structures  for  the  merlin  proteins  of  various  species.  The  region  with  a  predicted  P-sheet 
structure  is  shaded  in  grey,  while  the  region  with  an  a-helix  structure  is  shaded  in  black.  These  predicted  secondary  structures 
correspond  to  the  crystal  structural  data  [18],  which  are  shown  above  the  alignment  with  the  a-helix  region  indicated  with  a 
thick  black  bar  and  the  P-sheet  region  with  a  thin  black  bar.  The  predicted  a-helical  domain  in  the  central-to-C-terminal  region 
of  merlin  is  marked  with  an  open  bar.  Asterisks  denote  known  domains  of  the  merlin  protein  with  numbers  pointing  to  the  end 
of  truncated  Drosophila  merlin  protein  discussed  in  the  text. "+"  denotes  the  beginning  and  the  end  of  the  predicted  a-helical 
domain.  The  positions  of  specific  residues  in  the  PERM  domain  discussed  in  the  text  are  denoted  by  black  dots  below  the 
aligned  sequences. 
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Figure  6 

Schematic  diagram  of  the  exon  and  intron  structures  for  the  merlin  genes  of  various  species.  The  horizontal  line 
depicts  the  merlin  gene  with  its  size  indicated  in  base  pairs  (bp)  on  the  right.  The  upright  boxes  represent  exons.  The  lengths 
of  the  merlin  mRNA  sequences  available  in  the  database  are  shown  in  nucleotides  (nt)  and  the  lengths  of  the  predicted  merlin 
proteins  are  also  indicated  in  amino  acids  (aa).  The  indicated  human  NF2  mRNA  refers  to  the  longest,  full-length  transcript 
identified,  which  contains  a  long  3'  untranslated  region  [72].  Two  major  NF2  isoforms  I  and  II  are  produced  via  alternative  splic¬ 
ing  and  the  lengths  of  their  protein  products  are  shown  with  that  of  isoform  I  indicated  in  the  parentheses.  It  should  be  noted 
that  Northern  blot  analysis  detected  the  rat  and  mouse  NF2  mRNAs  of  about  4.5  kb,  indicating  that  the  sizes  of  the  rodent 
NF2  mRNAs  shown  here  are  not  full-length.  The  asterisk  (*)  denotes  the  exon-intron  structure  of  Brugia  malayi  predicted  from 
this  study. 


ine  whether  there  was  a  similar  structure  present  in  all 
merlin  proteins,  we  analyzed  21  merlin  sequences  from 
various  organisms  and  predicted  their  secondary  struc¬ 
tures  using  the  IPRED  program  [69] .  The  results  of  such  an 
analysis  for  six  representative  species  are  presented  in  Fig¬ 
ure  5.  The  predicted  locations  of  a-helices  and  jJ-sheets  in 
the  N-terminal  domain  support  the  experimental  findings 
from  the  structural  analysis  of  the  PERM  domain  of 
human  merlin  protein  [18].  In  addition,  a  predicted  a- 
helical  structure  in  the  central-to-C-terminal  region  was 
found  to  be  conserved  among  the  merlin  proteins  of  vari¬ 
ous  species  analyzed.  Previously,  it  was  shown  that  a  trun¬ 
cated  merlin  protein  of  Drosophila,  containing  residues  1- 
600,  lost  the  ability  to  localize  to  the  cytoplasm  and  was 
concentrated  at  the  plasma  membrane  [63].  ITowever,  two 
smaller  truncated  proteins,  consisting  of  residues  1-350 
or  1-375,  were  only  loosely  associated  with  the  plasma 


membrane.  These  results  suggest  that  the  predicted  a-hel- 
ical  region  of  merlin  is  important  for  its  intracellular  local¬ 
ization.  Since  almost  the  entire  a-helical  domain  was 
absent  in  these  truncated  proteins,  we  suggest  that  it  may 
contain  a  determinant  for  membrane  association.  This 
notion  is  further  supported  by  the  observation  that  addi¬ 
tional  truncated  proteins,  containing  residues  1-300  or 
missing  almost  the  entire  a-helical  domain,  were  diffusely 
localized  to  the  cytoplasm. 

In  human  merlin,  the  predicted  a-helical  structure  is  situ¬ 
ated  between  residues  K3 12  and  K478  (Figure  5  and  Addi¬ 
tional  File  1).  The  N-terminal  border  of  this  structure  was 
clearly  recognized  for  21  merlin  sequences  analyzed, 
whereas  the  C-terminal  boundary  could  be  traced  only  up 
to  Urochordata  {Ciona)  and  Nematoda  {Caenorhabditis). 
This  a-helical  domain,  predicted  from  all  21  merlin 
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Figure  7 

The  alignment  of  exons  with  specific  domains  of  merlin  reveals  the  presence  of  homologous  introns.  Boxes  rep¬ 
resent  the  coding  exons  with  numbers  indicated  accordingly.  The  locations  of  the  three  commonly  discussed  domains  are 
marked  with  horizontal  arrows  under  exons.  The  boundaries  among  these  domains  were  defined  according  to  the  human  mer¬ 
lin  protein.  The  asterisk  indicates  that  the  exon  structure  shown  is  common  to  the  merlin  genes  of  all  vertebrate  species  stud¬ 
ied,  including  Homo  sapiens,  Pan  troglodytes,  Canis  familaris,  /Vlus  musculus,  Rattus  norvegicus,  Danio  rerio,  Fugu  rubripes,  and 
Xenopus  laevis.  The  overall  merlin  gene  structure  of  Callus  gallus  is  similar,  except  that  exons  1 5  and  1 6  are  fused  together  in 
this  species. 


sequences,  contains  a  high  density  of  charged  amino  acids 
(from  about  25%  in  Ciona  to  greater  than  40%  in  verte¬ 
brates).  Sequence  alignment  reveals  19  conserved  amino 
acid  residues  in  this  predicted  a-helical  domain  (Addi¬ 
tional  File  1).  The  amino  acid  identity  for  the  predicted  a- 
helical  domain  within  each  phylogenetic  group  is  as  fol¬ 
lows:  1)  Genus  Drosophila  (D.  melanogaster  and  D.  yakuba) 
-  99%  (one  amino  acid  substitution),  2)  Genus 
Caenorhabditis  (C.  elegans,  C.  briggsae,  and  C.  remanie)  - 
85.7%,  3)  Genus  Ciona  (C.  intestinalis  and  C.  savignyi)  - 
71.7%,  4)  vertebrates  -  63.5%,  and  5)  mammals  -  90%. 
Taken  together,  these  results  indicate  that  the  merlin  pro¬ 
teins  of  various  species  contain  a  conserved  a-helical 
domain  in  the  central  to  G-terminal  region. 

Exon-intron  structural  evolution  of  the  merlin  gene 

Recent  progress  in  automated  computational  analysis  of 
partially  and  completely  sequenced  genomes  using  a  gene 
prediction  method  and  the  analysis  of  expressed  sequence 
tag  (EST)  has  provided  considerable  opportunities  not 
only  to  describe  novel  genes  but  also  their  exon-intron 
structures.  Such  an  approach  also  allows  the  examination 
of  the  presence  of  different  splicing  variants/isoforms.  To 
understand  the  evolution  of  the  exon-intron  stmeture  of 
the  merlin  gene,  we  assembled  all  available  NF2  gene- 
related  sequences  from  various  taxa.  Using  the  sequences 
of  proteins,  mRNAs,  and  combined  contigs  [70],  we  estab¬ 
lished  the  stmeture  of  the  merlin-like  gene  for  Brugia 
malayi,  which  consists  of  12  exons  and  11  introns  (Figure 
6).  Analogously,  the  homolog  of  the  NF2  gene  in 
Caenorhabditis  elegans  was  found  to  contain  1 1  exons  and 
10  introns.  It  should  be  mentioned  that  the  two  NF2-like 


sequences,  nfm-la  and  nfm-lb  of  Caenorhabditis  elegans, 
differ  from  each  other  only  by  the  sequence  of  the  last 
exon  (Additional  File  1),  suggesting  that  they  represent 
cDNA  isoforms. 

As  shown  in  Figure  6,  the  general  arrangement  of  the  mer¬ 
lin  gene  stmeture  is  conserved  among  mammalian  spe¬ 
cies,  especially  at  the  region  that  encodes  the  N-terminal 
domain,  albeit  the  length  of  the  genes  may  differ  slightly. 
The  human  NF2  gene  consists  of  17  exons  and  spans 
about  95  kb  of  DNA  [5,6,71,72].  NF2  transcripts  undergo 
alternative  splicing,  generating  multiple  isoforms  [72-79]. 
Isoform  I,  missing  exon  16,  and  isoform  II,  containing  all 
17  exons,  are  the  two  predominant  species.  As  the  result 
of  alternative  splicing,  isoform  1  encodes  a  595  amino 
acid  protein.  Isoform  II  differs  from  isoform  I  only  at  the 
C-terminus.  Insertion  of  exon  16  into  the  mRNA  provides 
a  new  stop  codon,  resulting  in  a  590  amino  acid  protein 
that  is  identical  to  isoform  I  over  the  first  579  residues. 
Because  of  the  presence  of  a  long  3'  untranslated  region, 
the  longest  NF2  isoform  I  RNA,  containing  the  sequence 
from  all  17  exons,  is  about  6.1  kb  [72].  The  merlin  genes 
of  other  mammalian  species,  Rattus  norvegicus,  Canis 
familiaris,  Mus  musculus  and  Pan  troglodytes,  contain  16 
exons  (Figures  6  and  7).  In  addition,  alternatively  spliced 
merlin  isoforms  have  been  found  in  rodent  species  [80- 
82].  On  the  contrary,  the  structure  of  the  merlin  genes  of 
Callus  gallus  and  Fugu  rubripes  are  arranged  differently 
from  those  of  mammalian  species,  with  15  and  14  exons 
spanning  much  shorter  DNAs  of  only  about  25  kb  and 
12.3  kb,  respectively  (Figure  6).  Although  the  NF2  gene  of 
Fugu  rubripes  has  not  yet  been  completely  assembled,  we 
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predict  that  it  lacks  the  sequences  of  the  first  and  the  last 
exons  of  the  mammalian  NF2  gene  based  on  our  sequence 
alignment  (see  Additional  File  1). 

In  spite  of  the  presence  of  1 6  exons  and  the  size  of  tran¬ 
script  similar  to  those  found  in  some  vertebrates,  the  gene 
for  the  merlin-like  protein  of  dona  intestinalis  is  relatively 
small  with  only  about  4.3  kb  (Figure  6).  This  tendency 
towards  reduction  of  intron  length  and  number  continues 
to  be  seen  in  the  merlin  gene  of  worms  and  insects  (Fig¬ 
ures  6  and  7).  The  gene  for  the  merlin-like  protein  of 
Caenorhabditis  elegans,  consisting  of  1 1  exons,  spans  about 
4.7-kb  DNA,  and  that  of  Brugia  malayi,  containing  12 
exons,  is  about  5.5  kb  in  length.  The  NF2  homolog  of  Dro¬ 
sophila  melanogaster  and  the  gene  for  the  merlin-like  pro¬ 
tein  of  Apis  mellifera  are  only  about  2.9  kb,  the  smallest  in 
the  merlin  clade,  and  consist  of  5  and  8  exons,  respectively 
(Figure  6).  Interestingly,  some  conservation  of  the  posi¬ 
tions  of  homologous  introns  can  be  found  in  the  NF2 
gene  from  various  species  (Figure  7);  however,  the  sizes  of 
their  introns  are  variable.  Such  an  evolutionary  conserva¬ 
tion  of  homologous  introns  implies  that  the  presence  of 
regulatory  sequences  in  the  introns  to  regulate  the  tran¬ 
scriptional  event. 

Unlike  the  sizes  and  stmetures  of  the  merlin  or  merlin-like 
genes  in  various  organisms,  the  lengths  of  the  merlin  pro¬ 
teins  and  transcripts  have  not  changed  very  much  during 
evolution  (Figure  6).  Moreover,  several  functionally 
important  domains  of  merlin  also  remain  conserved. 
Since  the  merlin  protein  of  the  insect  emerged  after  devi¬ 
ation  from  that  of  the  worm,  which  was  anciently  derived 
from  the  common  ancestor  (Figure  1),  it  appears  that  the 
decrease  in  gene  size  and  exon  number  occurred  specifi¬ 
cally  within  the  insect  group.  This  branch  of  merlin  evolu¬ 
tion  is  likely  to  develop  independently  and  in  the 
opposite  direction  from  those  more  recently  developed 
merlin  proteins  of  Chordata.  Parallel  evolution  towards 
the  increase  in  merlin  gene  size  and  exon  number 
between  the  worm  and  Chordata  appears  to  be  less  likely. 
Thus,  it  is  possible  that  the  common  ancestor  for  the  mer¬ 
lin  genes  of  the  worm,  the  insect,  and  Chordata  may  con¬ 
tain  a  few  more  exons.  During  evolution,  reduction  of 
introns  and/or  fusion  of  exons  occur  within  the  insect 
group. 

It  is  evident  that  the  genome  of  the  insect  is  more  compli¬ 
cated  than  that  of  the  worm.  The  simplification  of  the 
merlin  gene  stmeture  in  the  insect  is  unique  and  may  have 
a  functional  significance.  This  observation  may  explain 
the  lack  of  splicing  variants  of  the  NF2  homolog  in  Dro¬ 
sophila,  in  contrast  to  those  merlin  isoforms  found  in 
mammals  [72-82]  and  in  Caenorhabditis  elegans  as  we  pre¬ 
dicted  in  this  study. 


Conclusion 

We  have  conducted  the  phylogenetic  analysis  of  merlin 
diversity  across  metazoan  genomes  using  the  experimen¬ 
tally  annotated  and  predicted  sequences  in  conjunction 
with  bioinformatic  tools.  We  show  that  the  merlin  pro¬ 
teins  have  a  monophyletic  origin  with  a  root  in  the  early 
metazoa.  We  have  also  established  the  expansion  of  the 
ERM-like  ancestors  within  the  vertebrate  clade  that 
occurred  after  its  separation  from  Urochordata.  Several 
potentially  important  sites  that  are  conserved  among  all 
merlin  proteins  but  are  divergent  in  the  ERM  members 
have  been  identified.  As  supported  by  the  crystal  stmeture 
data,  these  conserved  residues  are  likely  to  be  important 
for  merlin  function.  Analysis  of  the  evolution  of  the  mer¬ 
lin  gene  stmeture  reveals  the  existence  of  common  splic¬ 
ing  variants  in  human  and  Caenorhabditis  elegans.  While  a 
trend  toward  the  increase  of  gene  length  during  evolution 
was  observed,  a  reduction  of  intron  number  and  length 
appears  to  occur  in  the  merlin  gene  of  the  insect  group. 
Taken  together,  our  results  have  important  implications 
for  the  evolution  of  the  merlin  proteins  and  their  possible 
functional  variability  in  different  taxa. 

Methods 
BLAST  search 

Initial  sequences  of  genes  and  proteins  of  interest  from 
various  organisms  were  identified  from  the  National 
Center  for  Biotechnology  Information  (NCBI)  database 
[83]  using  the  BLAST  algorithm  [84].  We  then  searched 
the  desirable  sequences  across  genomic  databases  of  com¬ 
pletely  or  partially  sequenced  genomes  available  at  The 
Sanger  Institute  [85]  and  The  Institute  for  Genomic 
Research  (TIGR;  [86]).  Also,  we  investigated  other  availa¬ 
ble  sequence  databases  that  contain  information  for  spe¬ 
cific  organisms.  The  sources  of  sequences  of  the  predicted 
or  experimentally  annotated  merlin  and  ERM  proteins  are 
summarized  in  Table  1 . 

To  obtain  the  entire  amino  acid  sequence  of  an  annotated 
protein,  we  used  UniProt  from  Universal  Protein  Resource 
[87].  The  erythrocyte  membrane  proteins  4.1  sequences  of 
Homo  sapiens  (GenBank:  GAI21970).  Mus  musculus  (Gen- 
Bank:  NP  001006665),  and  Danio  rerio  (GenBank: 
AA097985)  were  also  included  in  the  analysis  as  an  out¬ 
group.  Because  of  the  presence  of  many  non-conserved 
and  large  introns  in  the  genes  of  interest,  we  conducted 
BLAST  search  using  TBLASTN  alignment  algorithm  in  the 
cases  where  no  protein  sequences  were  available. 

Alignments  and  phylogeny 

The  GLUSTAL_X  program  [88]  was  used  to  align  the  char¬ 
acterized  or  predicted  protein  sequences  from  different 
species.  Phylogenetic  analysis  was  carried  out  using  the 
MEGA2.1  program  [49]. 
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Secondary  structure  prediction 

The  secondary  structure  prediction  program  JPRED  [69] 
was  used  to  predict  the  secondary  structure  of  each  merlin 
protein  from  various  species.  This  program  uses  physical- 
chemical  properties  of  the  amino  acid  sequence  and  neu¬ 
ral  network  for  recognition  of  a-helices  and  P-sheets. 
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Additional  File  1 

Complete  amino  acid  sequence  alignment  of  the  merlin  and  ERM  pro¬ 
teins.  Letters  shaded  in  grey  illustrate  the  conservation  pattern  of  aligned 
sequences.  The  names  of  the  merlin  proteins  from  various  species  are 
shown  in  yellow.  The  conserved  residues  of  the  'Blue  Box'  are  also  shaded 
in  yellow.  The  positions  of  residues  discussed  in  the  text  are  colored  in  red. 
Blue  letters  denote  the  conserved  residues  within  the  predicted  a-helical 
domain.  Numbers  indicated  at  the  end  of  each  sequence  refer  to  the  posi¬ 
tions  of  the  last  residue  within  each  protein  analyzed. 
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